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Description 

[0001] This application is a continuation-in-part of U.S. Patent Application Serial No. 07/872,643 filed on April 17, 
1992. 

FIFI D OF THE INVENTION 

[0002] The present invention relates, in general, to materials and methods relevant to cell-cell adhesion. More par- 
ticularly, the invention relates to novel Ca 2+ -dependent cell adhesion proteins, referred to as cadherins, and to polynu- 
cleotide sequences encoding the cadherins. The invention also relates to methods for inhibiting binding of the cadherins 
to their natural ligands/antiligands. 

BACKGROUND 

[0003] In vivo, cell-cell adhesion plays an important role in a wide range of events including morphogenesis and 
organ formation, leukocyte extravasion, tumor metastasis and invasion, and the formation of cell junctions. Additionally, 
cell-cell adhesion is crucial for the maintenance of tissue integrity, e.g., of the intestinal epithelial barrier, of the blood 
brain barrier and of cardiac muscle. 

[0004] Intercellular adhesion is mediated by specific cell adhesion molecules. Cell adhesion molecules have been 
classified into at least three superfamilies including the immunoglobulin (Ig) superfamily, the integrin superfamily and 
the cadherin superfamily. All cell types that form solid tissues express some members of the cadherin superfamily sug- 
gesting that cadherins are involved in selective adhesion of most cell types. 

[0005] Cadherins have been generally described as glycosylated integral membrane proteins that have an N-termi- 
nal extracellular domain that determines binding specificity (the N-terminal 113 amino acids appear to be directly 
involved in binding), a hydrophobic membrane-spanning domain and a C-terminal cytoplasmic domain (highly con- 
served among the members of the superfamily) that interacts with the cytoskeleton through catenins and other cytoskel- 
eton-associated proteins. Some cadherins lack a cytoplasmic domain, however, and appear to function in cell-cell 
adhesion by a different mechanism than cadherins that have a cytoplasmic domain. The cytoplasmic domain is required 
for the binding function of the extracellular domain in cadherins that do have a cytoplasmic domain. Binding between 
members of the cadherin family expressed on different cells is mainly homophilic (i.e., a member of the cadherin family 
binds to cadhering of its own or a closely related subclass) and Ca 2+ -dependent. For recent reviews on cadherins, see 
Takeichi, Annu. Rev. Biochem., 59:237-252 (1990) andTakeichi, Science, 251, 1451-1455 (1991). 
[0006] ' The first cadherins to be described (E-cadherin in mouse epithelial cells, L-CAM in avian liver, uvomorulin in 
the mouse blastocyst, and CAM 120/80 in human epithelial cells) were identified by their involvement in Ca 2+ -depend- 
ent cell adhesion and by their unique immunological characteristics and tissue localization. With the later immunological 
identification of N-cadherin, which was found to have a different tissue distribution from E-cadherin, it became apparent 
that a new family of Ca 2+ -dependent cell-cell adhesion molecules had been discovered. 

[0007] The molecular cloning of the genes encoding mouse E- [see Nagafuchi era/., Nature, 329: 341-343 (1987)], 
chicken N- [Hatta ef a/., J.Celt Biol., 106: 873-881 (1988)], and mouse P-[Nose era/., EMBOJ. 6\ 3655-3661 (1987)] 
cadherins provided structural evidence that the cadherins comprised a family of cell adhesion molecules. Cloning of 
chicken L-CAM [Gailin et al., Proc. Natl. Acad Sci. USA, 84: 2808-2812 (1987)] and mouse uvomorulin [Ringwald et 
a/., EMBO /., 6; 3647-3653 (1987)] revealed that they were identical to E-cadherin. Comparisons of the amino acid 
sequences of E-, N-, and P-cadherins showed a level of amino similarity of about 45%-58% among the three sub- 
classes. Liaw era/., EMBOJ., 9: 2701-2708 (1990) describes the use of PCR with degenerate oligonucleotides based 
on one conserved region of E-, N-and P-cadherins to isolate N- and P-cadherin from a bovine microvascular endodthe- 
lia! cell cDNA. The Liaw et al., supra, results implied that there were only E-, N-, and P-cadherins because no new cad- 
herins were identified. Also in 1990, it was imported in Heimark ef a/., J. Cell Biol., 110: 1745-1756 (1990) that an 
antibody generated to bovine aortic endothelial cells recognized an intercellular junctional molecule designated V-cad- 
herin which had a similar molecular weight to known cadherins and was able to inhibit Ca 2+ -dependent cell endothelial 
cell adhesion. The article did not disclose any sequence information for the protein recognized by the antibody 
[0008] No further cadherin genes were described until the identification of eight of the novel cadherins claimed 
herein was reported in Suzuki etal., Cell Regulation, 2: 261-270 (1991). Subsequently, several other cadherins were 
described including chicken R-cadherin [Inuzuka etal., Neuron, 7: 69-79 (1991)], mouse M-cadherin [Donalies etal, 
Proc. Natl. Acad. Sci. USA, 88: 8024-8028 (1991)], chicken B-cadherin [Napolitano etal., J. Cell. Biol., 113: 893-905 
(1991)], and T-cadherin [chicken in Ranscht etal, Neuron, 7: 391-402 (1991) and chicken and human in Patent Coop- 
eration Treaty (PCT) International Publication No. WO 92/08731 published on May 29, 1992]. 

[0009] The determination of the tissue expression of the various cadherins reveals that each subclass of cadherins 
has a unique tissue distribution pattern. For example, E-cadherin is found in epithelial tissues while N-cadherin is found 
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in nonepithelial tissues such as neural and muscle tissue. The unique expression pattern of the different cadherins is 
particularly significant when the role each subclass of cadherins may play in vivo in normal events (e.g., the mainte- 
nance of the intestinal epithelial barrier) and in abnormal events (e.g., tumor metastatis or inflammation) is considered. 
Supression of cadherin function has been implicated in the progression of various cancers. See Shimoyama era/., Can- 

5 cer Res., 52: 5770-5774 (1992). Different subclasses or combinations of subclasses of cadherins are likely to be 
responsible for different cell-cell adhesion events in which therapeutic detection and/or intervention may be desirable. 
Studies have also suggested that cadherins may have some regulatory activity in addition to adhesive activity. Matsu- 
naga etal, Nature, 334, 62-64 (1988) reports that N-cadherin has neurite outgrowth promoting activity and Mahoney et 
aL, Cell, 67, 853-868 (1991) reports that the Drosophila fat tumor supressor gene, another member of the cadherin 

10 superfamily, appear to regulate cell growth. Expression of the cytoplasmic domain of N-cadherin without its extracellular 
domain has been shown in Kintner et aL, Cell, 69: 229-236 (1992) to disrupt embryonic cell adhesion and in Fugimori 
etal., MoL Biol. Cell, 4: 37-47 (1993) to disrupt epithial cell adhesion. Thus, therapeutic intervention in the regulatory 
activities of cadherins expressed in specific tissues may also be desirable. 

[0010] There thus continues to exist a need in the art for the identification and characterization of additional cad- 
is herins participating in cell-cell adhesion and/or regulatory events. Moreover, to the extent that cadherins might form the 
basis for the development of therapeutic and diagnostic agents, it is essential that the genes encoding the proteins be 
cloned. Information about the DNA sequences and amino acid sequences encoding the cadherins would provide for the 
large scale production of the proteins and for the identification of the cells/tissues naturally producing the proteins, and 
would permit the preparation of antibody substances or other novel binding molecules specifically reactive with the cad- 
20 herins that may be useful in modulating the natural ligand/antiligand binding reactions in which the cadherins are 
involved. 

SUMMARY OF THE INVENTION 

25 [0011] The present invention provides materials and methods that are relevant to cell-cell adhesion. In one of its 
aspects, the present invention provides purified and isolated polynucleotide sequences (e.g., DNA and RNA, both 
sense and antisense strands) encoding novel cadherins, cadherin-4 through -1 2. Preferred polynucleotide sequences 
of the invention include genomic and cDNA sequences as well as wholly or partially synthesized DNA sequences, and 
biological replicas thereof (i.e., copies of purified and isolated DNA sequences made in vivo or in vitro using biological 

30 reagents). Biologically active vectors comprising the polynucleotide sequences are also contemplated. 

[0012] The scientific value of the information contributed through the disclosures of the DNA and amino acid 
sequences of the present invention is manifest. For example, knowledge of the sequence of a cDNA encoding a cad- 
herin makes possible the isolation by DNA/DNA hybridization of genomic DNA sequences that encode the protein and 
that specify cadherin-specific expression regulating sequences such as promoters, enhancers and the like. DNA/DNA 

35 hybridization procedures utilizing the DNA sequences of the present invention also allow the isolation of DNAs encod- 
ing heterologous species proteins homologous to the rat and human cadherins specifically illustrated herein. 
[001 3] According to another aspect of the invention, host cells, especially eucaryotic and procaryotic cells, are sta- 
bly transformed or transfected with the polynucleotide sequences of the invention in a manner allowing the expression 
of cadherin polypeptides in the cells. Host cells expressing cadherin polypeptide products, when grown in a suitable cul- 

40 ture medium, are particularly useful for the large scale production of cadherin polypeptides, fragments and variants; 
thereby enabling the isolation of the desired polypeptide products from the cells or from the medium in which the cells 
are grown. 

[0014] The novel cadherin proteins, fragments and variants of the invention may be obtained as isolates from nat- 
ural tissue sources, but are preferably produced by recombinant procedures involving the host cells of the invention. 

45 The products may be obtained in fully or partially glycosylated, partially or wholly de-glycosylated or non-glycosylated 
forms, depending on the host cell selected or recombinant production and/or post- isolation processing. 
[001 5] Cadherin variants according to the invention may comprise polypeptide analogs wherein one or more of the 
specified (i.e., naturally encoded) amino acids is deleted or replaced or wherein one or more nonspecified amino acids 
are added: (1) without loss, and preferably with enhancement, of one or more of the biological activities or immunolog- 

50 ical characteristics specific for a cadherin; or (2) with specific disablement of a particular ligand/antiligand binding func- 
tion of a cadherin. 

[0016] Also contemplated by the present invention are antibody substances [e.g., monoclonal and polyclonal anti- 
bodies, chimeric and humanized antibodies, and antibody domains including Fab, Fab' and F(ab') 2 , single chain anti- 
bodies, and Fv or single variable domains] and other binding proteins or peptides specifically react with cadherins of 
55 the invention. Antibody substances can be developed using isolated natural, recombinant or synthetic cadherin 
polypeptide products or host cells expressing such products on their surfaces. The antibody substances may be utilized 
for purifying polypeptides of the invention, for determining the tissue expression of the polypeptides and as antagonists 
of the ligand/antiligand binding activities of the cadherins. Specifically illustrating antibody substances of the invention 
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are the monoclonal antibodies produced by the hybridomas designated 30Q8A, 30Q4H, 45A5G, 30S2F and 45C6A 
which were all deposited with the American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockvilie, Maryland 
20852 on April 6, 1993 and were respectively assigned ATCC Deposit Nos. HB11316, HB1 1317, HB1 1318, HB1 1319 
and HB1 1 320. Also illustrating antibody substances of the invention is the monoclonal antibody produced by the hybri- 
doma designated 30T11G which was deposited with the ATCC on April 8, 1993 and was assigned ATCC Deposit No. 
HB11324. 

[0017] The DNA and amino acid sequence information provided by the present invention makes possible the sys- 
tematic analysis of the structure and function of the cadherins described herein and definition of those molecules with 
which the cadherins will interact on extracellular and intracellular levels. The idiotypes of anti-cadherin monoclonal anti- 
bodies of the invention are representative of such molecules and may mimic natural binding proteins (peptides and 
polypeptides) through which the intercellular and intracellular activities of cadherins are modulated. Alternately, they 
may represent new classes of modulators of cadherin activities. Anti-idiotypic antibodies, in turn, may represent new 
classes of biologically active cadherin equivalents. 

[0018] Methods for modulating cadherin activity may involve contacting a cadherin with an antibody (or antibody 
fragment), another polypeptide or peptide ligand (including peptides derived from cadherins or other proteins, or a novel 
peptide), or a small molecule ligand that specifically binds to a portion (extracellular or cytoplasmic) of the cadherin. 
[0019] ' Numerous aspects and advantages of the present invention will be apparent upon consideration of the fol- 
lowing detailed description thereof, reference being made to the drawing wherein: 

FIGURE 1 is a bar graph illustrating the binding of polymorphonuclear neutrophils and T cells to fusion proteins 
comprising extracellular subdomains of cadherin-5. 

DETAILED DESCRIPTION 

25 [0020] The present invention is illustrated by the following examples wherein Example 1 describes the isolation of 
cDNA sequences encoding rat cadherins-4 through -11 and -1 3; Example 2 describes the isolation of cDNA sequences 
encoding the human homologs of rat cadherins-4, -5, -6, -8, -10, -1 1 and -13 and the isolation of a human cadherin not 
identified in rat, cadherin-12; Example 3 characterizes the relationship of cadherins of the invention to previously iden- 
tified cadherins in terms of amino acid sequence and structure. The generation of polyclonal and monoclonal antibodies 

30 specific for cadherins of the invention is described in Example 4. Example 5 describes the construction of expression 
constructs comprising cadherin-4, -5 and -8 sequences, transfection of mammalian cells with the constructs and results 
of cell-cell adhesion assays performed with the transfected cells. Example 6 presents the results of assays for cadherin 
mRNA and protein expression in various mammalian tissues, cells and cell lines. The results of in vitro transendothehal 
migration assays involving cadherin-5 and assays of neutrophil and T-cell binding to cadherin-5 fusion protein are 

35 described in Example 7. Example 8 describes expression of cadherin-5 in the blood-brain barrier and Example 9 
describes cadherin-5 peptides that are capable of increasing endothelim permeability. Example 1 0 describes the asso- 
ciation of the cytoplasmic domain of cadherin-5 with plakoglobin. The disclosures of Suzuki et a/., Cell Regulation, 
supra; Suzuki era/., J. Cell. Biol., 115, Abstract 72a (1991); Suzuki et a/., Cell. Struc. Fund, 16, 605 (1991); andTani- 
hara etal., Invest Ophthalmol. Vis. ScL, 32, 1013 (1991) are incorporated by reference herein for purposes of illustrat- 

40 ing the background of the invention. 

Example 1 

[0021] Partial cDNA clones encoding nine novel cadherins were isolated from rat brain and retina by PCR. Eight of 
the novel rat cadherin cDNAs were isolated using degenerate PCR primers based on highly conserved regions of the 
cytoplasmic domain of known cadherins and one was isolated using degenerate PCR primers based on moderately 
conserved regions of the extracellular domain of known cadherins. 
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A Preparation of Rat cDNA 

[0022] Total RNAs were prepared from rat brain by the guanidium isothiocyanate/cesium chloride method 
described in Maniatis etal., pp. 196 in Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York: Cold 
Spring Harbor Laboratory (1982). Brain poly(A) + RNAs were then isolated using an Invitrogen (San Diego, CA) Fast- 
Track kit. Rat retina poiy(A) + RNA was purchased from Clonetech (Palo Alto, CA). cDNA was synthesized from the 
poly(A) + RNA of both rat brain and retina using a cDNA synthesis kit (Boehringer Mannheim Corporation, Indianopolis, 
IN). 
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B. Design and Synthesis of PCR Primers 
Corresponding to Cadherin Cytoplasmic Domain 

[0023] A first pair of degenerate oligonucleotide primers, listed below in IUPAC nomenclature, was designed to cor- 
5 respond to highly conserved sequences in the cytoplasmic domain of mouse N-, E-, and P-cadherins. Underlined 
sequences at the end of each oligonucleotide indicate an EcoR1 site added to the primers to facilitate cloning of the 
fragments generated by PCR. 

Degenerate Primer 1 

w 

TAPPYD (SEQIDNO: 1) 

5' GAATTCACNGCNCCNCCNTAYGA 3' (SEQ ID NO: 2) 
Degenerate Primer 2 

75 

FKKLAD (SEQ ID NO: 3) 

3' AARTTYTTYRANCGNC TCTTAAG 5' (SEQ ID NO: 4) 

The degenerate oligonucleotides were synthesized using the Applied Biosystems mode! 380B DNA synthesizer (Foster 
20 City, CA). 

C. Design and Synthesis of PCR Primers 
Corresponding to Cadherin Extracellular Domain 

25 [0024] A second pair of degenerate oligonucleotide primers, listed below in IUPAC nomenclature, was designed to 
correspond to moderately conserved sequences in the third subdomain of the extracellular domain of mouse N-, E-, 
and P-cadherins. The extracellullar domains of the mouse N-, E- and P-cadherins have been characterized as having 
five internal subdomains, some of which may be involved in cadherin interaction with Ca 2+ . Underlined sequences at 
the end of each oligonucleotide indicate an EcoR1 site added to the primers, to facilitate cloning of the fragments gen- 

30 erated by PCR. 

Degenerate Primer 3 

K(P/G)(LYI/V)D(F/Y)E (SEQ ID NO: 5) 
35 5' GAATTCAARSSNNTNGAYTWYGA 3' (SEQ ID NO: 6) 

Degerenate Primer 4 

(N/D)E(A/P)PXF (SEQ ID NO: 7) 
40 3' TRCTY SGNGGNNNNAARCT3MG 5' (SEQ ID NO: 8) 

D. Cloning of cDNA Encoding Eight Novel Rat Cadherins 

[0025] PCR amplification reactions of rat brain and retina cDNA were carried out either with degenerate primers 1 
45 and 2 or with degenerate primers 3 and 4 under conditions essentially the same as those described in Saiki et at., Sci- 
ence, 239, 487-491 (1988). Briefly, 100 ng of brain or retina first strand cDNA was used as template for amplification by 
Taq DNA polymerase (International Bioltechnology, New Haven, CT) using 10 jig of each primer set per reaction. PCR 
reactions were initiated by adding 2 units of Taq DNA polymerase to the reaction solution, after which 35 PCR reaction 
cycles were carried out. Reaction cycles consisted of denaturation performed at 94°C for 1 .5 minutes, oligonucleotide 
so annealing at 45°C for 2 minutes, and elongation at 72°C for 3 minutes. The resulting PCR fragments were separated by 
agarose gel electrophoresis, and DNA bands of the expected size were extracted from the gel and digested with 
EcoR1. The fragments were then cloned into the M 13 vector (Boehringer Mannheim Corp., Indianapolis, IN) and E. colt 
JM101 cells were transformed with the resulting constructs. Individual clones were then isolated and sequenced. 
Sequencing of the DNAs was carried out using a sequenase kit (United States Btochemicals, Cleveland, OH) and the 
55 resulting DNA and deduced amino acid, sequences of the clones were compared to sequences of known cadherins 
using the Microgenie program (Beckman, Fullerton, CA). 

[0026] Ten representative cDNA clones encoding cadherins were identified from the PCR reaction based on 
degenerate primers 1 and 2. Two clones corresponded to rat N-, and E-cadherins, but eight clones encoded previously 
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undescribed cadherins, and were designated cadherins-4 through -11. The DNA and deduced amino ac.d sequences 
of the eight rat cytoplasmic domain cDNA clones are respectively set out in SEQ ID NOs: 9 and 10 (cadhenn-4) SEQ 
ID NOs 1 1 and 12 (cadherin-5), SEQ ID NOs: 13 and 14 (cadherin-6), SEQ ID NOs: 15 and 16 (cadher.n-7), SEQ ID 
NOs: 17 and 18 (cadherin-8), SEQ ID NOs: 19 and 20 (cadherin-9), SEQ ID NOs: 21 and 22 (cadhenn-10) and SEQ 
ID NOs: 23 and 24 (cadherin-11). 

[0027] An additional novel cadherin was identified from the PCR reaction based on degenerate primers 3 and 4, 
and it was designated cadherin-1 3. The DNA and deduced amino acid sequences of the rat cadherm-1 3 fragment are 
respectively set out in SEQ ID NOs: 25 and 26. 

[0028] The PCR reaction based on degenerate primers 3 and 4 also amplified sequences which were later deter- 
mined to be fragments of the extracellular domains of rat cadherins-4, -5, -6, -8, -9, -10, -11 and -13. The DNA and 
amino acid sequences of these extracellular fragments are respectively set out in SEQ ID NOs: 27 and 28 (cadhenn- 
4) SEQ ID NOs: 29 and 30 (cadherin-6), SEQ ID NOs: 31 and 32 (cadherin-8), SEQ ID NOs: 33 and 34 (cadherm-9 , 
SEQ ID NOs- 35 and 36 (cadherin-1 0), SEQ ID NOs: 37 and 38 (cadherin-11), SEQ ID NOs: 39 and 40 (cadher.n-1 3). 
[00291 Larger cadherin-8 and -1 0 cDNAs were isolated from a rat brain cDNA library made in Uni-ZAP vector (Strat- 
agene La Jolla, CA) using labelled cadherin-8 extracellular domain PCR fragment (SEQ ID NO: 17) or cadhenn-10 
extracellular domain fragment (SEQ ID NO: 21) as probes. Two types of cadherin-8 cDNA clones were isolated. The 
first type encodes a full length cadherin, but the second type encodes a truncated protein the sequence of which 
diverges from the first type of cadherin-8 clone near the N-terminus of the fifth extracellular subdomain (EC5) The trun- 
cated clone contains a short stretch of unique sequence in the N-terminus of ECS but lacks the remainder of EC5 the 
transmembrane domain and the cytoplasmic domain. DNA and deduced amino acid sequences of the full length clone 
are respectively set out in SEQ ID NOs: 41 and 42 and the DNA and deducted amino acid sequences of the truncated 
cadherin-8 clone are set out in SEQ ID NOs: 43 and 44. The cadherin-10 cDNA clone that was isolated has an open 
reading frame which begins at a region corrsponding to the middle of the first extracellular domain (EC1) of Previously 
identified cadherins. The DNA and deduced amino acid sequences of the cadherin-1 0 clone are set out in SEQ ID NOs: 
45 and 46. 

Example 2 

[00301 Full length cDNAs encoding human homologs of rat cadherins-4, -8, -1 1 and -1 3 and partial cDNAs encod- 
ing human homologs of rat cadherins-6 and -10 were isolated from a human fetal brain cDNA library (XZapll vector 
Stratagene). A full length cDNA encoding a human homolog of rat cadherin-5 was isolated from a human placental 
cDNA library (A.gt1 1 vector, Dr. Millan, La Jolla Cancer Research Foundation, La Jolla, CA). 

[0031] Probes for screening the human fetal brain and placental cDNA libraries were amplified by PCR from human 
brain cDNA (Dr. Taketani, Kansain Medical University, Moriguchi, Osaka, Japan) using the primers described in Exam- 
ple 1 B-C Probes consisting of human cadherin-4, -5, -6, -8, -10 and -1 1 sequences were generated using degenerate 
primers 1 and 2 and probes consisting of human cadherin-13 sequence were generated using degenerate primers 3 
and 4. Amplification of the human fetal brain cDNA with degenerate primers 3 and 4 also generated a PCR fragment 
encoding a cadherin not isolated from rat, designated cadherin-1 2. 

[0032] PCR fragments encoding human cadherins-4, -5, -6, -8, -1 0, -1 1 , -12 and -13 were labeled with P and 
used to probe the human fetal brain and placental cDNA libraries according to the plaque hybridization method 
described in Ausubel et al., Eds., Current Protocols in Molecular Biology, Sections 6.1.1 to 6.1.4 and 6.2.1 to 6.2.3, 
John Wiley & Sons, New York (1987). Positives were plaque-purified and inserts were cut out using an in vivo excision 
method The inserts were then subcloned into the M1 3 vector (Boehringer Mannheim) for sequencing. 
[0033] Inserts consisting of full length cDNAs encoding human homologs of rat cadherins-4, -8, -1 1 , -1 2 (putative) 
and -13 and partial cDNAs encoding human homologs of rat cadherins-6 and -10 were identified in clones from the 
human fetal brain cDNA library and a full length cDNA encoding a human homolog of rat cadherin-5 was identified in a 
clone from the human placental cDNA library. The DNA and deduced amino acid sequences of the human home .logs 
are respectively set out in SEQ ID NOs: 47 and 48 (cadherin-4), SEQ ID NOs: 49 and 50 (cadherin-5) SEQ ID NOs: 
51 and 52 (cadherin-6), SEQ ID NOs: 53 and 54 (cadherin-8), SEQ ID NOs: 55 and 56 (cadherin-10), SEQ ID NOs: 57 
and 58 (cadherin-11), SEQ ID NOs: 59 and 60 (cadherin- 12), and SEQ ID NOs: 61 and 62 (cadhenn-13). 



Example 3 



[0034] Comparison of the full-length sequences of the novel human cadherins described in Examples 1 and 2 with 
sequences of previously described cadherins and cadherin-related proteins provides support for the proposal that cad- 
herins can be divided into at least three subgroups based on amino acid sequence identity and/or domain structure. 
Identity values for one possible alignment of the sequences of the extracellular domains of selected human cadherins 
are presented in Table 1 below. 
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20 [0035] Based on such sequence alignments and on the fact that certain combinations of cadherin sequences seem 
to have conserved stretches of amino acids when aligned, one subgroup of cadherins may include E-cadherin, N-cad- 
herin, P-cadherin and cadherin-4, while a second subgroup may include cadherin-5, cadherin-8, cadherin-1 1 and cad- 
herin-1 2. Cadherins-6, -7, -9 and -10 may also be included with the second subgroup based on their partial amino acid 
sequences disclosed herein. The amino acid sequence of cadherin-4 exhibits especially high amino acid sequence 

25 identity with that of R-cadherin (92%), indicating that cadherin-4 may be the human homolog of chicken R-cadherin. All 
cadherins in these two subgroups have a similar structure. Following an initiation codon, each has a signal sequence, 
prosequence, proteolytic cleavage site of precursor protein, an extracellular domain (which comprises five subdomains 
EC1-5), a transmembrane sequence and a cytoplasmic domain. For cadherin-5, these sequences/domains appear to 
correspond to about the following amino acid positions of SEQ ID NO: 50: 1-24 (signal sequence), 25-43 (prose- 

30 quence), 44-147 (EC1), 148-254 (EC2), 255-368 (EC3), 369-475(EC4), 476-589 (EC5), 590-616 (transmembrane 
sequence) and 617-780 (cytoplasmic domain). 

[0036] Cadherin-1 3, T-cadherin and V-cadherin may be representative of a third subgroup of cadherins. Cadherin- 
13 consists of a cadherin-like extracellular domain, but has no domains that would correspond to the typical transmem- 
brane or cytoplasmic domains of other cadherins. Even though about 1 0% of the clones obtained by PCR using degen- 

35 erate primers 3 and 4 were cadherin-1 3 clones, none of the clones included sequences corresponding to a cytoplasmic 
domain. An attempt to isolate a cDNA that contained this region by PCR using a primer corresponding to the most C- 
terminal region of cadherin-1 3 available and a mixed oligonucleotide primer corresponding to a well-conserved amino 
acid sequence of the cytoplasmic domain of cadherins failed to generate any product with the anticipated molecular 
weight. A similar protein, T-cadherin, has been identified in chicken which also lacks the typical cadherin cytoplasmic 

40 domain. The amino acid sequence identity between the two molecules is about 80%. Cadherin-1 3 may be the human 
homologue of chicken T-cadherin or may be a closely related molecule. Human cadherin-1 3 and avian T-cadherin may 
also both be closely related to V-cadherin. A 29-amino acid amino terminal sequence of bovine V-cadherin is similar to 
the start of the precursor region of cadherin-1 3 (93%) and T-cadherin (79%). V-cadherin is a 135 KD protein which 
appears to be restricted in tissue distribution to endothelium. In constrast, mature T-cadherin has a molecular weight of 

45 95 KD and shows a wide tissue distribution. Both V-cadherin and T-cadherin are linked to the cell membrane through 
phosphoinositiol. 

Example 4 

so [0037] Polyclonal and/or monoclonal antibodies specific for cadherins of the invention were generated. 
A. Generation of Polyclonal Antibodies 

[0038] Bacterial fusion proteins consisting of maltose binding protein fused to portions of cadherin extracellular 
55 subdomains (either human cadherin-4, -5 or -1 1 , or rat cadherin-8) were generated and subsequently used for the gen- 
eration of polyclonal antibodies. 

[0039] A cDNA fragment corresponding to a 40 KD portion of the extracellular domain of human cadherin-5 (nucle- 
otides 535 to 1527 of SEQ ID NO: 49) was synthesized by PCR from the full-length human cadherin-5 cDNA described 
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in Example 2. The fragment was subcloned into the multicloning site (EcoR1 -Xba\) of the pMAL-RI plasmid vector [New 
England Biolabs Inc. (NEB), Beverly, MA]. The resulting construct encodes maltose binding protein fused to the extra- 
cellular domain of cadherin-5. Constructs encoding maltose binding protein fused to the three N-terminal subdomains 
of human cadherin-4, rat cadherin-8 and human cadherin-1 1 were generated by similar methods. 
[0040] E. coli NM522 ceils (Stratagene) were then transformed with one of the fusion protein constructs and grown 
in quantity. After disruption of E. coli cells, the individual fusion proteins were purified by affinity column chromatography 
using amylose resin (NEB) according to the instructions of the manufacturer. When subjected to SDS-PAGE, the puri- 
fied fusion proteins each showed essentially one band of the expected size. 

[0041] A total of five hundred ng of a fusion protein in Freund's complete adjuvant was injected into rabbits at four 
subcutaneous sites. Subsequent injections were carried out at three week intervals using 100 ug of the fusion protein 
in Freund's incomplete adjuvant also at four subcutaneous sites. The resulting polyclonal sera generated from immuni- 
zation of rabbits with cadherin-4, -5 or -8 fusion protein were collected and tested for specificity on L cells transfected 
with the appropriate cadherin sequence (see Example 5). Polyclonal serum generated from immunization of rabbits 
with cadherin-1 1 was also collected. 

[0042] Immunoblotting of various cell types showed that the The anti-cadherin-4 polyclonal serum reacts with pro- 
tein of about 130 KD in L cells transfected with full length cadherin-4 cDNA and in rat brain. Cadherin-5-specific serum 
reacts with a protein of about 1 35 KD in L cells transfected with a full length cadherin-5 DNA and with a protein of about 
1 35 KD in human umbilical vein endothelial cells (HUVEcs). The serum does not react with MDCK cells that expressed 
high levels of E-cadherin. In bovine aortic endothelial cells, the anti-cadherin-5 serum reacts with a protein of about 120 
KD. Additionally, the anti-cadherin-5 serum reacts with a protein which has the same molecular weight in rat brain 
endothelial cells in culture. The cadherin-8 polyclonal antibody detected a strong band of about 90 KD and a weak band 
of about 130 KD in rat brain. 

B. Generation of Monoclonal Antibodies Specific for Hu man Cadherin-5 

[0043] Monoclonal antibodies to cadherin-5 were prepared using bacterial fusion proteins containing subdomains 
of the extracellular domain of human cadherin-5 as immunogens. The fusion proteins prepared included maltose bind- 
ing protein and the extracellular subdomains 1-2 (EC1-2) or extracellular subdomains 2-4 (EC2-4) of cadherin-5 in the 
bacterial expression vector pMAL (NEB). The two fusion proteins were expressed in bacteria and purified on amylose- 
sepharose as described in foregoing section on generation of polyclonal antibodies. The purified fusion proteins were 
used separately to immunize mice at two subcutaneous sites (100 u.g of fusion protein per mouse in Freund's complete 
adjuvant). The mice then were subcutaneously immunized with Freund's incomplete adjuvant. 
[0044] The spleen from each mouse was removed sterility and treated in the same manner. Briefly, a single-cell 
suspension was formed by grinding the spleen between the frosted ends of two glass microscope slides submerged in 
serum free RPMI 1640 supplemented with 2 mM L-glutamine, 1 mM sodium pyruvate, 100 units/ml penicillin and 100 
mg/ml streptomycin (RPMI) (Gibco, Canada). The cell suspension was filtered through a sterile 70-mesh cell strainer, 
and washed twice by centrifuging at 200 g for 5 minutes and resuspending the pellet in 20 ml serum free RPMI. Thy- 
mocytes taken from 3 naive Balb/c mice were prepared in a similar manner. NS-1 myeloma cells, kept in log phase in 
RPMI with 11% fetal bovine serum (FBS) (Hyclone Laboratories, Inc., Logan, UT) for three days prior to fusion, were 
centrifuged at 200 g for 5 minutes, and the pellet was washed twice as described for the mouse spleen cells. 
[0045] After washing, the spleen cells and myeloma cells were brought to a final volume of 10 ml in serum free 
RPMI, and 10 ul of that final volume was diluted 1:100 in serum free RPMI. Twenty uJ of each dilution was removed, 
mixed with 20 u.l 0.4% trypan blue stain in 0.85% saline, loaded onto a hemacytometer and counted. Two x 10 8 spleen 
cells were combined with 4 x 10 7 NS-1 cells, centrifuged and the supernatant was aspirated. The cell pellets were dis- 
lodged by tapping the tube and 2 ml of 37°C PEG 1500 (50% in 75 mM Hepes, pH 8.0) (Boehringer Mannheim) was 
added with stirring over the course of 1 minute, followed by adding 14 ml of serum free RPMI over 7 minutes. An addi- 
tional 1 6 ml RPMI was added and the cells were centrifuged at 200 g for 10 minutes. After discarding the supernatant, 
the pellet was resuspended in 200 ml RPMI containing 15% FBS, 100 mM sodium hypoxanthine, 0.4 mM aminopterin, 
16 mM thymidine (HAT) (Gibco), 25 units/ml IL-6 (Boehringer Mannheim) and 1.5 x 10 6 thymocytes/ml (plating 
medium). The suspension was dispensed into ten 96-well flat bottom tissue culture plates at 200 ml/well. Cells in plates 
were fed on days 2, 4, and 6 days post-fusion by aspirating approximately 100 ml from each well with an 18 G needle, 
and adding 100 ml/well plating medium described above except containing 10 units/ml IL-6 and lacking thymocytes. 
[0046] Fusions 30 (from a mouse immunized with EC2-4) and 45 (from a mouse immunized with EC1-2) were 
screened initially by antibody capture ELISA, testing for presence of mouse IgG. Secondary screening of fusions 30 
and 45 consisted of assays using plates coated with a monolayer of fixed endothelial cells for ELISAs. HUVEcs, Lewis 
rat brain endothelial cells (LeBCE), and bovine aortic endothelial cells (BAE) were allowed to grow in 96-well flat bottom 
tissue culture microt'iter plates until the bottom of well was completely covered with a monolayer of cells. Plates were 
washed twice with 100 ul/weli of Ca 2+ /Mg 2+ free PBS (CMF-PBS) and aspirated completely. Cells were then fixed with 
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100 p-t/well of 3% p-Formaldehyde, 1% Sucrose in CMF-PBS at room temperature for 30 minutes. Cells were then per- 
meablized with approximately 250 ul/well of CSK buffer (0.5% Triton 100, 100mM NaCl, 10mM PIPES, 2mM MgCI) and 
incubated at room temperature for 30 minutes. Plates were blocked with 250 ul/well of 2% BSA in 1 X CMF-PBS (block- 
ing solution) and incubated at 37°C for 60 minutes. Blocking solution was aspirated and 50 to 1 00 ul/well of supernatant 

5 from fusion plates was added. Plates were incubated at room temperature for 60 minutes and then were washed one 
time with 250 uJ/well of 0.5% BSA in CMF-PBS (wash solution 1) and two times with 250 uJ/well of CMF-PBS (wash 
solution 2). One hundred fifty uJ of horseradish peroxidase conjugated goat anti-mouse IgG(fc) (Jackson ImmunoRe- 
search, West Grove, PA) diluted 1 :3500 in PBST was added and plates were incubated at room temperature for 60 min- 
utes. Plates were washed as before and 150 u.l substrate consisting of 1mg/ml o-phenylene diamine (Sigma) and 0.1 

10 ml/ml 30% H 2 0 2 in 1 0OmM Citrate, pH 4.5 was added. The color reaction was stopped after 30 minutes with the addi- 
tion of 50 }il of 1 5% H 2 S0 4 . A 490 was read on a plate reader (Dynatech). About 20 positive wells were identified for each 
fusion and were subsequently cloned. 

[0047] Hybridomas were screened in cloning steps in an ELISA assay by testing for reactivity of monoclonals to the 
cadherin-5 EC2-4 fusion protein and excluding maltose binding protein reactive monoclonals. Immulon 4 plates 

is (Dynatech, Cambridge, MA) were coated at 4°C with 50 jil/well fusion protein diluted to 0.1 ug/well (for fusion protein) 
and to 0.2 fig/well (for maltose binding protein alone) in 50mM carbonate buffer, pH 9.6. Plates were washed 3 times 
with PBS, 0.05% Tween 20 (PBST) and 50 ul hybridoma culture supernatant was added. After incubation at 37°C for 
30 minutes, and washing as above, 50 uJ of horseradish peroxidase conjugated goat anti-mouse IgG(fc) (Jackson 
ImmunoReseach, West Grove, PA) diluted 1 :3500 in PBST was added. Plates were incubated at 37°C for 30 minutes 

20 and washed 4 times with PBST One hundred ul substrate consisting of 1 mg/ml o-phenylene diamine (Sigma Chemical 
Co., St. Louis, MO) and 0.1 jxt 30% H 2 0 2 in 100 mM citrate, pH 4.5 was added. The color reaction was stopped after 5 
minutes with the addition of 50 uJ of 15% H 2 S0 4 . Absorbance at 490 nm was determined using a plate reader. 
[0048] The hybridomas designated 30Q8A (ATCC HB11316), 30Q4H (ATCC HB11317), 45A5G (HB11318), 
30S2F (HB1 1319), 45C6A (HB1 1320), 30T1 1 G (ATCC HB1 1324), 30M8G, 30O6E and 30R1 A] were identified as reac- 

25 tive with endothelial cells and with the cadherin-5 EC2-4 fusion protein. The hybridomas were cloned twice by limiting 
dilution and grown in ascites. The monoclonal antibodies produced by the hybridomas were isotyped in an ELISA 
assay. The results of the assay are presented in Table 2 below. 

C. Subdomain Specificity of C5 Specific Monoclonal Antibodies 

30 

[0049] To determine if the hybridomas produced monoclonal antibodies reactive with unique epitopes of the extra- 
cellular domain of C5, the monoclonal antibodies were purified, biotinylaled, and tested in a cross competition ELISA. 
Immulon IV 96-well plates were coated with either EC 1-2 or EC2-4 cadherin-5 fusion protein at 0.2 ng/ml in 50 ul 50mM 
NaC0 3 , pH 9.6 overnight at 4°C. The wells were aspirated and washed three times with PBS/0.05% Tween 20. The 

35 plate was then blocked with 50 u.l/we)l PBS, 2% BSA (Sigma) for 30 minutes at 37°C. Monoclonal antibodies were puri- 
fied from hybridoma supernatants over a protein A-Sepharose column and the eluted antibody was dialyzed against 
0.1 M NaC0 3 pH 8.2. One mg/ml of antibody was reacted with 60 uJ of a 1 mg/ml stock solution in DMSO of NHS-biotin 
(Pierce Chemical Co., Rockford, IL) for 1 hour at room temperature and the reaction was stopped by dialysis overnight 
at 4°C against CMF/PBS. The biotinylated antibodies in PBS/0.05% Tween 20 were then added as primary antibody 

40 (50 ul/well) to a plate coated with fusion protein and incubated for 30 minutes at 37°C. The plate was then aspirated and 
washed three times with PBS/0.05% Tween 20. Peroxidase-conjugated strepavidin in PBS/Tween was added 50 unwell 
and incubated for 30 minutes at 37°C. The plate was aspirated and washed three times in PBS/0.05% Tween 20, and 
o-phenylenediamine in 1 0OmM citrate buffer and hydrogen peroxide was added at 100 u.l/well. The plate was developed 
at room temperature for 5-15 minutes. The reaction was stopped with 50 ul/well 15% sulfuric acid and the plate was 

45 read on a plate reader. Results of the assay are presented in Table 2 below. 

[0050] To confirm subdomain specificity, the cadherin-5 fusion proteins EC1 -2 and EC2-4 were run on SDS-PAGE 
(10%) and immunoblotted with the cadherin-5 specific monoclonal antibodies. 

[0051] Table 2 below set outs the domain specificity and isotype of the cadherin-5 specific monoclonal antibodies. 

50 

Table 2 



Monoclonal Antibody 


C5 Subdomain 


Isotype 


30Q4H 


2 


igG 2b 


45A5G 


2 




45C6A 


2 


igGi 
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Table 2 (continued) 



Monoclonal Antibody 


C5 Subdomain 


Isotype 


30S2F 


3-4 


igGi 


30Q8A 


3-4 




30T11G 


3-4 


ig6i 



[0052] Competition assays were carried out as described above for assays for binding to cadherin-5 EC2-4 fusion 
ro protein except that unlabelled primary cadherin-5 specific monoclonal antibodies (or mouse IgG) were added 30 min- 
utes prior to addition of biotinylated cadherin-5 specific monoclonal antibodies. Monoclonal antibodies produced by the 
hybridomas 30M8G, 30O6E and 30RIA compete for a site that is near or identical to the binding site of the antibody 
produced by hybridoma 30Q4H. 

15 Example 5 

[0053] Human cadherins-4 and -5 and rat cadherin -8 were expressed in mouse fibroblast L cells (ATCC CCL1 .3) 
which do not normally express cadherins. 

20 A. Construction of Expression Vectors 

[0054] The cDNA sequences encoding human cadherins-4 and -5 which are described in Example 2 and the cDNA 
sequence encoding rat cadherin-8 which is described in Example 1 were subcloned into the multicloning site of expres- 
sion vector pRC/RSV (Invitrogen). 

25 [0055] Cadherin-4 DNA sequences were isolated by an in vivo excision procedure from the AZapll clone (described 
in Example 2) containing the entire coding sequence of cadherin-4. Using a helper virus, the sequences were excised 
from XZapll in the form of Bluescript plasmid. The plasmid was then cut with Hind\\\ and blunt-ended with T4 polymer- 
ase. The resulting DNA fragment was redigested with Spel to generate a cadherin-4 cDNA fragment having a blunt end 
and a Spel sticky end. The fragment was purified by agarose gel electrophoresis and subcloned into the pRC/RSV 

30 expression vector that had been previously digested with Spel and Xba\ (the Xba\ end was blunt-ended with T4 
polymerase). 

[0056] The Xgt1 1 clone containing the entire coding sequence of cadherin-5 (described in Example 2) was cut with 
EcoRI and the resulting fragment containing the cadherin-5 sequences was purified by agarose gel electrophoresis. 
The purified fragment was then subcloned into the EcoRI site of the Bluescript plasmid. Cadherin-5 sequences were 
35 cut from the resulting construct with HincH and Xba\ and subcloned into the Not\-Xba\ site of the pRC/RSV vector. 
[0057] The full length cDNA encoding rat cadherin-8 was excised from the Uni-ZAP clone described in Example 1 
by digestion with Kpn\, followed by blunt-ending and re-digestion with Spel. The cadherin-8 encoding fragment was 
purified by agarose gel electrophoresis and was subcloned into the pRC/RSV vector which had been digested with 
Xba\, blunt-ended and redigested with Spel. 

40 

B. Transfection of L Cells 

[0058] Mouse fibroblast L cells were transfected with the human cadherin-4 and -5 and rat cadherin-8 expression 
constructs by a Ca 2+ phosphate precipitation method and stable transfectants were obtained by G41 8 selection. Cad- 
45 herin-4 and -8 transfectant cells showed a morphology similar to that of parental L cells (fibroblastic), but cadherin-5 
transfectant cells exhibited a flattened morphology. Neuro 2a cells (ATCC CCL131) were also transfected by a Ca 2+ 
phosphate precipitation procedure with the cadherin-4 and cadherin-8 expression constructs. Cadherin-4 transfectants 
showed epithelial structure, suggesting that cadherin-4 has activity in epithelial structure formation and may be involved 
in the neural tissue development. 

50 

C. Northern and Western Blot Assays of Cadherin mRNA 
and Protein Expression in Transfected Cells 

[0059] Both cadherin-4, -5 and -8 transfectants showed mRNA of the expected size of 3.5 kb, 3.2 kb and 3 kb, 
55 respectively, in Northern blot analysis using the appropriate full length human cDNAs as a probe. (See Example 6A for 
a description of the Northern blot assay.) 

[0060] For Western blots, cadherin-4, -5 and -8 transfectants were washed with PBS and SDS-PAGE sample buffer 
was added directly to the cells. SDS-PAGE (Laemmli) was carried out and and gels were blotted electrophoretically 
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onto PVDF membrane. The membranes were incubated in TBS containing 5% skim milk for 2 hours at room tempera- 
ture and then were incubated with the appropriate polyclonal antibody in TBS containing 0.05% Tween 20 for 1 hour at 
room temperature. After four washes (of 5 minutes each) with TBS containing 0.05% Tween 20, the membranes were 
incubated with alkaline phosphatase conjugated anti-rabbit IgG antibody (Promega Corp., Madison, Wl) in TBS con- 
5 taining 0.05% Tween 20 for 1 hour at room temperature. The membranes were then washed again four times with TBS 
containing 0.05% Tween 20 at room temperature and developed by using Promega Western blue. Cadherin-4, -5 and 
-8 polyclonal antibodies each reacted with a band of about 1 30 KD. 

D. Calcium Protection from Trypsin Digestion 

10 

[0061] Since cadherins have been shown to be protected from trypsin digestion by Ca 2+ , the effect of Ca 2+ on 
trypsin treatment (0.01 % soybean trypsin for 30 minutes at 37°C) of human cadherin-4 and -5 and rat cadherin-8 
expressed on the surface of transfected L cells was examined. Two mM Ca 2+ protected the cadherin-4 from the trypsin 
digestion, but cadherin-5 and cadherin-8 were digested easily even in the presence of 1-5 mM of Ca 2+ . 

15 

E. Cell-Cell Adhesion Assay 

[0082] The cell-cell adhesion activity of the transfected cells was assayed by a re-aggregation assay as described 
in Yoshida-Noro et a!., Devel. Biol., 101, 19-27 (1984). Briefly, transfectants were grown to near confluency and then 

20 dispersed into single cells with mild trypsin treatment (0.01 % for 1 5 minutes) in the presence of 2mM Ca 2+ . After wash- 
ing, the trypsinized cells were incubated in Hepes buffered saline (HBS) containing 2mM CaCI 2 , 1% BSA and 20 u.g/ml 
deoxynuclease on a rotary shaker at 50 rpm for 30 to 60 minutes and then cell aggregation was monitored. Cadherin- 
4 transfectant cells aggregated within 30 minutes and formed relatively large aggregates, whereas cadherin-5 transfect- 
ant cells did not aggregate under the same conditions. However, cadherin-5 transfectants gradually re-aggregated and 

25 formed relatively small aggregate after prolonged incubation (4-5 hours or more). Similarly, cadherin-8 transfectants did 
not show significant cell adhesion activity. Parental L cells did not show cell adhesion under the same conditions. The 
sensitivity of cadherin-5 and cadherin-8 to trypsin digestion may account for the reduced cell adhesion seen in the reag- 
gregation assay because the transfected L cells are initially dispersed with trypsin in the assay. 

30 Example 6 

[0063] The expression of mRNAs encoding cadherins of the invention was examined in rat brain, kidney, liver, lung 
and skin and in various human cells by Northern blot analysis. The expression of cadherin protein was also examined 
in endothelial cells and leukocytes by immunofluorescence or immunoblotting. 

35 

A. Northern Blot Assays of Rat Tissue and Human Cells 

[0064] Poly(A) + RNA from rat brain, kidney, liver, lung and skin was prepared as described in Example 1 for rat 
brain. The RNA preparations were then electrophoresed in an 0.8% agarose gel under denaturing conditions and trans- 

40 ferred onto a nitrocellulose filter. Northern blot analyses were carried out according to a method described in Thomas, 
Proc. Natl. Acad. Sci. USA, 77, 5201-5202 (1980). Filters were hybridized with rat cadherin PCR fragments (described 
in Example 1) labeled with 32 P, including fragments corresponding to cadherins-4 through -11. The final hybridization 
wash was in 0.2X standard saline citrate containing 0.1% sodium dodecyl sulfate at 65°C for 10 minutes. 
[0065] Cadherin-4 and cadherin-8 through -10 mRNAs were detected only in rat brain. The cadherin-8 PCR frag- 

45 ment hybridized to a major band of about 3.5 kb and a minor band of about 4.5 kb in rat brain. The mRNAs detected 
may be alternative splicing products and may correspond to the truncated and full length cadherin-8 clones described 
in Example 1 . Cadherin-6 and -7 probes gave weak, signals on rat brain mRNA even after prolonged exposure. Cad- 
herins-5, -6 and -1 1 mRNAs were detected in rat brain and other rat tissues including cadherin-5 mRNA in lung and 
kidney, cadherin-6 mRNA in kidney, and cadherin-1 1 mRNA in liver. 

so [0066] The expression of cadherin-8 and -11 in cultured human SK-N-SH neuroblastoma cells (ATCC HTB11), 
U251 MG glioma cells and Y79 retinoblastoma cells (ATCC HTB1 8) was also assayed by Northern blot. Human cDNAs 
encoding cadherins-8 and -1 1 (described in Example 2) were labelled with ^P and used as probes of poly(A) + RNA 
prepared from the cells using an Invitrogen FastTrack kit. 

[0067] The Northern blot procedure detected cadherin-8 RNA in the neuroblastoma and retinoblastoma cell lines, 
55 while cadherin-1 1 RNA was detected only in neuroblastoma cells. These results indicate that at least some of the cad- 
herins of the invention are expressed in neurons and glial cells and/or their precursor cells. 

[0066] Cadherin-5 RNA was detected by Northern blot assay of HUVECs (Clonetics), but was not detected in A431 
human epidermoid carcinoma cells (ATCC CRL1555) or IMR90 human fibroblast cells (ATCC CCL186). 
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B. Immunoflourascence of Endothelial Ce lls and Immunoblottinq of Leukocytes 

[0069] Cultured endothelial cells isolated from bovine aorta, bovine brain microvasculature and human umbilical 
vein were subjected to immunofluorescence microscopy using anti-C5 polyclonal antibodies. Cadherin-5 protein at the 
cell junctions which was in close association With the peripheral actin microfilaments was labelled. 
r007Ol In contrast, when freshly isolated leukocytes (human PMN, lymphocytes and monocytes) or the monocyte- 
like cell line U937 were analyzed for the expression of cadherin-5 by immunoblotting using polyclonal antibodies and a 
monoclonal antibody (30O6E) to cadherin-5, no cadherin-5 was detected. Furthermore, using a pan-cadhenn antibody 
[Geiger ef al., J. Cell Science, 97: 607-614 (1 990)] specific for the cytoplasmic tail, no other cadherins were detected in 
these cell populations. 

Example 7 

[0071] Three in vitro transendothelial migration assays were utilized to show that cadherin-5 may participate in the 
movement of leukocytes across the intercellular junctions of endothelium. 

A. Transmigration Assays 

r00721 The migration of leukocytes (either human polymorphonuclear neutrophils or rat T cells) was followed for 
specific periods of time (15 minutes for PMNs and 2 hours forT cells). Immunofluorescent labeling of leukocytes using 
antibodies to specific cellular markers was used distinguish between leukocytes and endothelium. The polyclonal anti- 
bodies described in Example 4 were used to measure changes in the distribution of cadherin-5. An antibody (Novocas- 
tra Laboratories Ltd., United Kingdom) to PE-CAM1 (CD31) which is an intercellular junction molecule in endothelium 
was used as a control. 

r00731 The role of cadherin-5 in the transmigration of polymorphonuclear neutrophils (PMNs) across HUVEcs was 
analyzed The system utilized, which is described in Furie ef a/., J. Immunol, 143: 3309-3317 (1989), has been charac- 
terized with regard to electrical resistance of the endothelium and the adhesion molecules used in transmigration. 
HUVEcs were isolated in the absence of growth factor and cultured on human amniotic connechve tissue in a two- 
chamber system. PMN migration on IL1p-treated HUVEcs has previously been shown to involve E-select.n and p 2 
integrins (CD11/CD18). See Furie et al. J. Immunol., 148: 2395-2484 (1992). nmtronteH 
r00741 In the first assay, transmigration of PMNs was followed as an 1 1 minute time course on HUVEcs pretreated 
for four hours with IL1P (1.5 U/ml) (Collaborative Research Inc., Beford, MA). Prior to addition of neutrophils, antibodies 
to cadherin-5 heavily labelled the cell junctions of the HUVEcs in a continuous pattern. Pretreatment of the endotnelia 
monolayer with IL1p had no effect on the distribution of cadherin-5 in the HUVEc monolayer compared to a contro 
untreated culture. In the second assay, chemotaxis of PMNs across HUVEcs was stimulated by leukotriene B 4 (LTB 4 ) 
(Sigma) which was placed in the bottom chamber at 1 0" 7 M while neutrophils were added to the upper chamber. Chem- 
otaxis of PMNs to LTB 4 across the endothelial monolayer was previously shown to be blocked by antibod.es to CD1 1a, 
CD11b and ICAM-1. [See Furie ef al., Blood, 78: 2089-2097 (1991)] In both assays, PMNs were identrfied with ant.- 
CD45 antibody (Becton Dickinson, San Jose, CA). 

[0075] In both assays during the 1 1-minute time course, the majority of the PMNs that adhered also transmigrated. 
Addition of neutrophils caused a rapid redistribution and regional loss of cadherin-5 even at the earliest tone point (3 
minutes) CD31 was also lost at sites of disruption of the monolayer, but in general appeared to be more stable dunng 
the transmigration process. The loss of cadherin-5 is probably the result of proteases released from the neutrophils dur- 

ing transmigration^ ^ activa ted rat T cells were utilized instead of PMNs (for a two-hour time course). 

Rat brain microvascular endothelium was grown on Transwell 5 micron polycarbonate membranes (Costar, Cambridge, 
MA) T cells were identified using an anti-CD4 antibody (Serotec, Indianapolis, IN). In this assay, the loss of cadherin- 
5 immunolabeling did not occur during transendothelial migration even though 10% of the T cells had crossed the 
endothelium after two hours. These results demonstrate differential effects of PMN versus T cells on intercellular junc- 
tions during tranendothelial migration. Analysis by confocal microscopy suggests that CD4 antigen-activated T cells and 
PMNs have a ligand that is able to interact with cadherin-5 on the endothelium during transmigration. Photomicrographs 
from confocal analysis show that during leukocyte transendothelial migration leukocytes can be found spanning the 
intercellular junction. The leukocyte separates the cell junction and cadherin-5 remains on adjacent cells even though 
the endothelial cells are not in contact. 

R Adhesion of PMNs and T Cells to Cadherin-5 

[0077] To quantitate the binding of PMNs and activated T-cells to cadherin-5, a cell-substrate adhesion assay was 
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developed. This assay utilized plate-bound fusion proteins containing various extracellular subdomains of cadherin-5 
(EC1-2 or EC2-4, see Example 4) and measured the binding of dye-labelled leukocytes to cadherin-5 protein using a 
cytofluor2300 (Millipore, Bedford, MA). 

[0078] The purified fusion proteins were absorbed to styrene plates and the binding of dye-labeled leukocytes to 
5 the fusion proteins was compared to binding to maltose binding protein and heat denatured bovine serum albumin 
(BSA) which was used to block nonspecific binding. The fusion proteins were dissolved in PBS containing Ca 2+ and 
Mg 2 *, diluted into coating buffer and incubated overnight at 4°C. The plates were blocked with heat denatured BSA and 
then incubated with calcien (Molecular Probes, Eugene, OR)-labelled cells for 1 hour at 37°C. Results of the assay are 
presented in FIGURE 1 wherein the relative fluorescence values reported are the mean value of three samples. 
w [0079] PMNs bound to fusion proteins comprising the EC2-4 of cadherin-5, but preferentially bound to fusion pro- 
teins comprisng EC1-2. These results are consistent with presence of cadherin subdomain 2 sequences in both fusion 
proteins. CD4 antigen activated T cells bound EC2-4 fusion protein. All these results, which indicate that PMNs interact 
with a more terminal or exposed subdomain of cadherin-5, are consistent with the rate that these cell types cross the 
endothelium, PMNs transmigrate in a few minutes and T cells require 30-60 minutes. The binding of U937 cells could 
15 be blocked in a dose dependent manner by polyclonal antisera made to the cadherin-5 EC2-4 subdomains. 

[0080] The results presented in the foregoing paragraph in combination with the results presented in Example 6B 
that leukocytes do not express cadherins suggests that the counter ligand to which cadherin-5 binds on leukocytes is a 
distantly related cadherin or is not a cadherin. Cadherin binding has previously been thought to be homotypic. 

20 Example 8 

[0081 ] Expression of cadherin-5 in the blood-brain barrier in the endothelium of the cerebral cortex was assayed by 
Western blot and immunocytochemistry. 

[0082] A SDS lysate was prepared by boiling bovine or macaque capillaries in SDS sample buffer for 2 minutes and 
25 then drawing the extract through a 25 G syringe needle. The extract was centrif uged in a microf uge for 1 5 minutes at 
4°C. Protein concentration in the supernatant was determined by the BCA method (Pierce) using bovine serum albumin 
as a standard. Samples of the supernatent (75uxj) were separated by SDS- PAGE (Laemmli) and electrophoretically 
transferred to nitrocellulose. The nitrocellulose was blocked with 5% milk and 1 0% FBS in Tris-buffered saline, pH 8.0, 
containing 0.05% Tween 20. Cadherin-5 specific monoclonal antibodies (30Q4H and 45C6A) were added. After wash- 
30 ing to remove unbound antibody, the filters were incubated with alkaline phosphatase-conjugated anti-mouse IgG 
(Promega, Madison, Wl). Reactive bands were visualized by addition of NBT/BCIP (Sigma, St. Louis, MO). Expression 
of cadherin-5 was detected in the freshly isolated bovine and macaque capillaries. 

[0083] The Western blot results were confirmed by immunocytochemistry using the cadherin-5 antibodies 30Q4H 
and 45C6A. Macaque cerebral cortex was incubated in 15% sucrose in PBS for 30 minutes at 4°C and embedded in 

35 OCT compound (Tissue-Tek, Elkhart, IN) in cryomolds and quickly frozen. Six micron sections were cut and placed on 
glass slides. The slides were washed with PBS and fixed in 3% p-formaldehyde for 5 minutes. To permeabilize the tis- 
sue sections the slides were immersed in -20°C acetone for 10 minutes and air dried. The sections were blocked with 
2% goat serum and 1 % BSA in PBS for 30 minutes and then incubated with the primary antisera for 1 hour at room 
temperature. The sections were rinsed 3 times in PBS containing 0.1% BSA and incubated with biotinylated anti-rabbit 

40 or anti-mouse IgG (Vector Laboratories, Burlingame, CA) in 1 % BSA in PBS for 30 minutes. After rinsing 3 times, 
strepavidin-conjugated with horseradish peroxidase (Vector Laboratories) was added for 30 minutes and washed 3 
times. Immunolabeling was detected by reaction with diaminobenzoic acid in the presence of NiCI 2 . The monoclonal 
antibody 45C6A only appeared to label larger vessels and the monoclonal antibody 30Q4H labeled both large and 
microvessels. The cell junctions of cerebral capillaries were labelled with the anti-cadherin-5 antibodies in a localized 

45 site. 

[0084] These results and the results presented in Example 7 suggest cadherin-5 is involved in maintenance of the 
blood-brain barrier and that cadherin-5 peptides or cadherin-5 specific monoclonal antibodies may be able to open the 
blood-brain barrier. 

so Example 9 

[0085] Patent Cooperation Treaty (PCT) International Publication No. WO 91/04745 discusses fragments of cell 
adhesion molecules and antibodies to cell adhesion molecules which are purported to disrupt-microvascular and 
endothelial cell tight junctions. 

55 [0086] Three cadherin-5 peptides corresponding to the cell binding domain [HAV region, Blaschuk et a!., Devei. 
Bioi, 139: 227-229 (1990)], the calcium binding region A1 and the calcium binding region B1 of E-cadherin [Ringwald 
era/., EMBOJ., 6: 3647-3653 (1987)] were tested for the ability to affect the permeability of brain endothelium. The pep- 
tides utilized had the following sequences: 
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Peptide 1 (Amino acids 1 14 to 128 of SEQ ID NO: 50) 

LTAVIVDKDTGENLE, 
Peptide 2 (Amino acids 132 to 145 of SEQ ID NO: 50) 

SFTIKVHDVNDNWP, and 
Peptide 3 (Amino acids 1 68 to 1 78 of SEQ ID NO: 50) 

SVTAVDADDPT, respectively. 

[0087] Permeability was measured using a two-chamber culture system (Costar). Rat brain microvascular endothe- 
lium was grown on 12 mm Transwell filters with 3 micron pores (Costar) in the culture system. When the monolayers 
were confluent, two weeks after plating, 3 H-inulin (201 mCi/g) (New England Nuclear, Boston, MA) was added to the 
upper chamber. Cadherin-5 peptide at 100 ng/ml was added to both the upper and lower chambers. Radioactivity 
appearing in the bottom chamber was measured at 1 5 minute intervals over a two hour time course carried out at 37°C 
and was compared to the radioactivity appearing in the bottom chamber of cultures where no peptide was added or 
where no endothelial cells were present. 

[0088] Both peptides 1 and 3 increased endothelium permeability in comparison to control cultures. The increase 
in permeability obtained with peptide 3 was 2.5-fold and the increase with peptide 1 was 1 .5-fold over the controls. Pep- 
tide 2 had no effect on permeability. 

Example 10 

[0089] The functional properties of cadherins involve not only specific intercellular interactions, but also involve 
intracellular interactions with the cytoskeleton. Immunoprecipitation experiments utilizing the cadherin-5-specific rabbit 
polyclonal antibodies and the monoclonal antibody 30Q8A (see Example 4) were performed to determine with which 
proteins cadherin-5 interacts on an intracellular level. 

[0090] Endothelial cells were metabolically labeled overnight with 50 nCi/ml of [ 35 S]-methionine and were then 
extracted with 0.5% Triton X-100 in 10mM HEPES pH 7.4, 0.15M NaCI, 2mM EDTA, 2mM EGTA, 1mM phenanthroline 
and protease inhibitors. The inhibitors included 1mM PMSF, 10 u.g/ml aprotinin, leupeptin, pepstatin A, antipain, soy- 
bean trypsin inhibitor, 100 uo/ml chymostatin and TPCK, 40 u.g/ml of TPCK and bestatin, 50 ug/ml of benzamidine, 
1 mM o-vanidate and 20mM NaF. After 20 minutes on ice, the cells were scraped and centrifuged in a microfuge for 30 
minutes at 4°C. The supernatant was precleared and either polyclonal anti-cadherin-5 or normal rabbit serum was 
added and incubated overnight at 4°C. Protein A-sepharose (Pharmacia, Piscataway, NJ) was added for 2 hours at 4°C 
and centrifuged. A first low stringency wash with 1 0mM HEPES pH 7.4, 0.15M NaCI, 2mM EDTA and 2mM EGTA con- 
taining 1 % Triton X- 1 00, 0.5%. DOC and 0.2% SDS was performed. A second high stringency wash was performed with 
the same buffer containing 2% SDS. A final wash was then performed with Tris-buffered saline, and the samples were 
boiled and analyzed on SDS/PAGE (7%). Three bands with molecular weights of 104 KD, 95 KD, and 82 KD were iden- 
tified as associated with cadherin-5. 

[0091] Three intracellular proteins, termed catenins, have previously been identified by their ability to bind to the 
cytoplasmic domain of E-cadherin. These proteins have been designated a, (3, and 7 catenins and have molecular 
weights of 102 KD, 88 KD and 80 KD, respectively [Ozawa ef a/., EMBO J. 8: 1711-1717 (1989)]. The association of 
catenins with E-cadherin seem to be required for E-cadherin function because deletion of the cytoplasmic domain of E- 
cadherin results in loss of cell adhesion function and catenin binding. The molecular cloning of cc-catenin has shown it 
to be a vinculin-like protein [Nagaf uki ef al., Cell, 65: 849-857 (1 991 ); Herrenkenecht et a/., Proc. Natl. Acad. Sci. USA, 
88: 9156-9160 (1991)]. The amino acid sequence of the Xenopus 0-catenin [McCrea etal., Science, 254: 1359-1361 
(1991)1 exhibits 63% similarity to the human protein plakoglobin [Franke etal., Proc. Natl. Acad. Sci. USA, 86: 4027- 
4031 (1989)]. Plakoglobin has been localized to both the cytoplasmic region of desmosome and adherens junctions in 
epithelial cells. The desmonsomal component desmoglein I interacts with plakoglobin and is a member of the cadherin 
superfamily [Koch et a/., Eur. J. Cell. Biol., 53: 1-12(1 990)]. Plakoglobin has a molecular weight of 82 KD and may be 
the^catenin [Peifer etal., J. Cell Biol., 118: 681-691 (1992)]. Even though endothelial cells lack desmosome, they have 
been shown to contain plakoglobin-associated with intercellular junctions [Franke etal., Biol, of the Cell, 59: 205-218 
(1 987)]. Other cytoskeletal elements associated with cadherins are ankyrin and fodrin [Nelson et ai, J. Cell Biol., 1 10: 
349-357(1990)]. 

[0092] To identify whether plakoglobin was one of the proteins complexed to cadherin-5, an unlabeled lysate of 
bovine aortic endothelial cells was made and immunoprecipitation was carried out as described above using anti-cad- 
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herin-5 antibody. The unlabelled immunoprecipitates were separated by SDS/PAGE and then electrophoretically trans- 
ferred to nitrocellulose. The membrane was blocked with 5% milk in Tris-buffered saline, pH 8.0, containing 0.05% 
Tween 20 (TBST) and then was incubated with the murine monoclonal antibody PG5.1 (IBl Research Products, Cam- 
bridge, MA) to plakoglobin in blocking solution (1 :20) for 1 hour at room temperature. The membrane was washed with 
5 TBST and then incubated with goat anti-mouse IgG conjugated to alkaline phosphatase. An 82 KD protein was identi- 
fied using NBT/BCIP under both low and high stringency wash conditions. These results demonstrate that plakoglobin 
is tightly associated with the cytoplasmic domain of cadherin-5 in endothelium. Immunofluorescence studies of regen- 
erated endothelium show that cadherin-5 and plakoglobin are localized to the cell junctions and are coordinately regu- 
lated. 

w [0093] The interation of cadherin-5 with plakoglobin may be a target for modulation of cadherin-5 activity. 

[0094] While the present invention has been described in terms of preferred embodiments, it is understood that var- 
iations and improvements will occur to those skilled in the art. Thus, only such limitations as appear in the appended 
claims should be placed on the scope of the invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Suxuki, Shintaro 
<it) TITLE OF INVENTIONS CADHERIN MATERIALS AND METHODS 
(iii) NUMBER OF SEQUENCES: 62 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE t Marshall, O'Toole, Gersteln, Murray £ 

Borun 

(B) STREET: 6300 Sears Tower, 233 S. Wacker Drive 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: USA 

(F) ZIP: 60606 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE i Patentln Release /1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/872,643 

(B) FILING DATE: 17 APR 1992 

(viii) ATTORNEY /AGENT INFORMATION: 
<A) NAME: Noland, Greta E. 

(B) REGISTRATION NUMBER: 35,302 

(C) REFERENCE/DOCKET NUMBER: 31340 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (312) 474-6300 
.(B) TELEFAX: (312) 474-0448 
(C) TELEX: 25-3856 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino adds 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Thr Ala Pro Pro Tyr Asp 

1 5 

(2) INFORMATION . FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
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<it) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:2x 
GAATTCACNG CNCCNCCNTA VGA 
(2) INFORMATION FOR SEQ ID NO:3t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Phe Lye Lye Leu Ala Asp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS t single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:4i 
GAATTCTCNG CNARYTTYTT RAA 
(2) INFORMATION FOR SEQ ID NO:5r 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Mod if led-site 

(B) LOCATIONS 2 

(D) OTHER INFORMATION: /note- "The amino acid at this 
position is a proline or a glycine." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note* "The amino acid at this 

position is a leucine, an isoleucine or a valine." 

(ix) FEATURE: 

(A) NAME/KEY x Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION t /note- "The amino acid at this 
position is a phenylalanine or a tyrosine." 



17 



EP1 074 618 A2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Lye Xaa Xaa Asp Xaa Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 23 Uie pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
6AATTCAARS SNNTNGAYTW VGA 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(ix) FEATURES 

(A) FAKE/KEY: Modif ied-site 

(B) LOCATIONS 1 " ^ 
(D) OTHER INFORMATION: /note- "The amino acid at this 

position is an esparagine or an a apart ic acid*" 

(ix) FEATURE: 

<A) NAME /KEY s Modif ied-aite 
(B) LOCATIONS 3 
r (D) OTHER INFORMATION: /note* "The amino acid at this 
position is an alanine or a proline. " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Xaa Glu Xaa Pro Xaa Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPEs nucleic acid 

(C) STRAHDEDNESSs single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GAATTCRAAN NNNGCNGSYT CRT 
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10 



15 



(2) INFORMATION FOR SEQ ID NOt9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED HESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 
TCCCTGCTGO TCTTCGACTA CCAAGGCAGC CGTTCTACTC CAGGCTCTGT CAGCTCCCTG 60 
AACTCCTCCA CCTCCGGGGA TCAAGATTAC GACTACTTGA ATGACTGGGG GCCCCGG 117 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 10: 

Ser Leu Leu Val Phe Asp Tyr Glu Gly Ser Gly Ser Thr Ala Gly Ser 
1 5 10 15 

Val Ser Ser Leu Asn Ser Ser Ser Ser Gly Asp Gin Aep Tyr Asp Tyr 

20 25 30 

Leu Aan Aep Trp Gly Pro Arg 

35 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 120 base pairs 

(B) TYPEs nucleic acid 

(C) STRAND ED NESS : single 
(0) TOPOLOGY s linear 

(il) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACACTGCACA TCTACCCCTA CCAGGGCACA CACTCCATCG CAGAGTCCCT CAGCTCCCTG 60 
AGCACCAATT CCTCCGACTC TGACATCGAC TATCACTTC C TCAATGACTG GGGACCCAGG 120 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 40 amino acids 
so (B) TYPEs amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



55 



25 



30 



45 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Thr Leu His lie Tyr Cly Tyr Clu Cly Thr Clu Ser lie Ala Clu Ser 

i 5 io ;s 

Leu Ser Ser Leu Ser Thr Aan Ser Ser Asp Ser Asp He Asp Tyr Asp 
20 2 5 30 

Phe Leu Asn Asp Trp Cly Pro Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TCCTTGGCCA CCTATGCCTA CGAAGGAACT GCCTCCGTGG CCGACTCCCT CAGCTCACTA 
GAATCAGTGA CCACACATGG AGACCAAGAT TATGACTATT TGACTG^CTG GGGCCCTCGA 

(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TTPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Leu Ala Thr Tyr Ala Tyr Glu Gly Thr Gly Ser Val Ala Asp Ser 
1 5 10 15 

Leu Ser Ser Leu Glu Ser Val Thr Thr Asp Cly Asp Gin Kep Tyr Asp 
20 25 30 

Tyr Leu Ser Asp Trp Gly Pro Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
TCCCTTCAGA CTTATCCATT TGAAGGAAAT GGCTCAGTAG CTGAATCTCT CACTTCTTTA 60 
GATTCTAACA GCTCGAACTC TGATCAGAAT TATGACTACC TTAGTGACTG GGGTCCTCTC 120 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TTPE: amino acid 
(D) topology i linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ser Leu Gin Thr Tyr Ala Phe Glu Gly Asn Gly Ser Val Ala Glu Ser 
IS 10 15 

Leu' Ser Ser Leu Asp Ser Aan Ser Ser Asn Ser Asp Gin Asn Tyr Asp 
20 25 30 

Tyr Leu ser Asp Trp Gly Pro Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 17s 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 120 base pairs 

(B) TYPEs nucleic acid 

(C) STRAND ED NESS x single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 
TCCATTCAGA TTTATGCCTA TGAAGCCOGA GCGTCTCTCC CTGGCTCTCT CAGCTCGTTC 60 
GAGTCCACCA CATCAGACTC AGACCAGAAT TTTGACTACC TCACTGACTC GGGTCCCCGC 120 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 40 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ser lie Glrf Ile^Tyr Gly Tyr Glu Gly Arg Gly Ser Val Ala Gly Ser 
1.5 10 15 

Leu Ser Ser Leu Glu Ser Thr Thr Ser Asp Ser Asp Gin Asn Phe Asp 
20 25 30 
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Tyr Leu Ser Asp Trp Gly Pro Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 19: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 
TCCTTGGCCA CTTACGCCTA TGAAGGGAAT GATTCTGTAG CCAATTCTCT CAGCTCCTTA 
GAATCTCTCA CAGCTCATTG TACCCAGGAT TATGACTACC TTAGTGACTC GGGGCCACGC 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION I SEQ ID N0:20: 

Ser Leo Ala Thr Tyr Ala Tyr Glu Gly Asn Asp Ser Val Ala Asn Ser 
1 5 10 IS 

Leu Ser Ser Leu Glu Ser Leu Thr Ala Asp Cys Asn Gin Asp Tyr Asp 
20 25 30 

Tyr Leu Ser Asp Trp Gly Pro Arg 
35 40 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS s single 

(D) TOPOLOGY & linear 

(ii) MOLECULE TYPE: CDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
TCGCTGCCTA CCTATGCCTA TGAAGGAAAC CACTCTGTTG CTGAATCTCT GAGCTCCTTA 
GAATCAGGTA CCACTGAAGG AGACCAAAAC TACGATTACC TTCGAGAATG CGCGCCTCGG 
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10 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 22: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION t SEQ ID NO:22: 



Ser Leu Ala Thr Tyr Ala Tyr Glu Gly Asn Aep Ser Val Ala Glu Ser 
15 10 15 

is Leu Ser Ser Leu Glu Ser Gly Thr Thr Glu Gly Aep Gin Asn Tyr Asp 

20 25 30 

Tyr Leu Arg Glu Trp Gly Pro Arg 
35 40 

20 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPES cDNA 

(Xi) SEQUENCE DESCRIPTION j SEQ ID NO: 23: 

30 

TCCATCCAAA TCTATGGTTA TGAGGCGAGC GGTTCCGTGG CTCCCTCCCT GAGCTCCTTG 60 
GAGTCTCCCA CCACAGATTC GCACCTCCAC TACCACTATC TACAGAACTG GGGACCTCGG 120 

35 (2) INFORMATION FOR SEQ ID NO: 24: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 40 amino acids 

(B) TYPEs amino acid 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPES protein 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ser He Gin He Tyr Gly Tyr Glu Gly Arg Gly Ser Val Ala Gly Ser 
15 10 15 

Leu Ser Ser Leu Glu Ser Ala Thr Thr Asp ser Asp Leu Asp Tyr Asp 
20 25 30 

Tyr Leu Gin Asn Trp Gly Pro Arg 
35 40 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 25: 
AAGCGGTTTO ATTAOGAGAT CTCTGCCTTT CACACCCTGC TCATCAAACT GGAGAATGAC 
GACCCATTGG TACCCGACGT CTCCTATGGC CCCAGCTCCA CGGCCACTGT CCACATCACG 
CTCTTGGATG TCAACGAGGG ACCAGTCTTC 
(2) INFORMATION FOR SEQ ID NO: 26s 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE i protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Lye Arg Phe Asp Tyr Clu He Ser Ala Phe His Thr Leu Leu lie Lys 
15 10 15 

Val Glu Asn Olu Asp Pro Leu Val Pro Asp Val Ser Tyr Gly Pro Ser 

20 25 30 

Ser Thr Ala Thr Val His He Thr Val Leu Asp Val Asn Clu Gly Pro 
35 40 45 

Val Phe 

50 

(2) INFORMATION FOR SEQ ID NOx27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AAGGGTATGG ATTATGAGCT GAACCGTGCC TCCATGCTGA CCATAATGGT CTCCAACCAG 
GCGCCCCTGG CCAGCGGGAT CGAGATCTCC TTCCAGTCCA CAGTGGGGGT AACCATCTCT 
GTCACCGATC TCAACGAAGC CCCCTACTTC 
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w 



15 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH t SO amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Lye Gly Met Asp Tyr Glu Leu Aon Arg Ala Ser Met Leu Thr lie Met 
IS 10 15 

Val Ser Aan Gin Ala Pro Leu Ala Ser Gly He Gin Met Ser Phe Cln 
20 25 30 

Ser Thr Val Gly Val Thr He Ser Val Thr Asp Val Aan Glu Ala Pro 
35 40 45 



Tyr Phe 

20 50 



(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS r single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: CDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

AAACCACTGG ATTTTGAACT CAXCCAGCAG TACAOGTTCC ACATCGAGGC CACAGACCCC 60 

ACTATCAGAC TCGGATACCT CAGCAGCACT GOGGGCAAAA ACAAAGCCAA GATCATCATC 120 

35 AATGTCCTAG ATCTCCATGA GCCCCCTCTT TTC 153 

(2) INFORMATION FOR SEQ ID NO: 30: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LEN GTH t SI amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Lys Arg Leu Asp Phe Glu Leu He Gin Gin Tyr Thr Phe His He Glu 
15 10 15 



Ala Thr Asp Pro Thr He Arg Leu Gly Tyr Leu Ser Ser Thr Ala Gly 
20 25 30 

Lys Asn Lys Ala Lys He He He Asn Val Leu Asp Val Asp Glu Pro 
35 40 45 
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Pro Val Phe 
50 

(2) INFORMATION FOR SEQ ID NO: 31* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 base paim 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:31x 
AAGGGTTTGO ATTTTGAAAA GAAGAAACTG TATACCCTTA AAGTGGAAGC CTCCAATCCT 60 
TATCTTGAGC CAOCATTTCT CTACTTGGGG CCTTTCAAAG ATTCAGCCAC CCTTAGAATT 120 
GTGGTGGAGG ATGTAGATGA ACCTCCTGCC TTC 153 
(2) INFORMATION FOR SEQ ID NOi32i 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION x SEQ ID NO: 32: 

Lye Gly Leu Asp Phe Glu Lye Lys Lya Val Tyr Thr Leu Lys Val Clu 
15 10 15 

Ala Ser Aan Pro Tyr Val Glu Pro Arg Phe Leu Tyr Leu Gly Pro Phe 
20 25 30 

Lys Aap Ser Ala Thr Val Arg lie Val Val Clu Asp Val Asp Glu Pro 
35 40 45 

Pro Ala Phe 
50 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH} 153 base pairs 

(B) TYPES nucleic acid 

(C) STRANDED NESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEt cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

AAGCC T CT CC ACTTTGAGAC CAAAAAATCC TATACTCTGA AGGTGGAGGC AGCCAATATC 60 

CACATCGACC CAC C TTT CA G TGGCAGGGGA CCCTTTAAAG ATACAGCAAC AGTCAAAATT 120 

CTTCTAGAGG ATGCTGATGA GCCTCCGCTC TTC 153 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE i amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



w 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 34: 

Aap Ala Leu Aep Phe Glu Thr Lye Lys Ser Tyr Thr Leu Lys Val Glu 
15 10 15 

Ala Ala Asn lie His lie Asp Pro Arg phe Ser Gly Arg Gly Pro Phe 

15 20 25 30 

Lye Aep Thr Ala Thr Val Lys He Val Val Glu Aep Ala Asp Glu Pro 
35 40 45 

Pro val Phe 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 152 base pairs 

(B) types nucleic acid 
25 (C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEt CDHA 



in 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

AAGGGGGTGG ACTATGAAGC CAAAACAAGT TATACCCTGC GCATAGAAGC TGCAAATCGA 60 

CATGCTGATC CCCGCTTTCT CAGCTTGGCT CCATTCAGTG ACACAACAAC AGTTAAGATA 120 

35 ATTGTGGAAG ACGTGGATGA ACCCCCCTACT C 152 

(2) INFORMATION FOR SEQ ID N0:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 
„ (B) TYPE; amino acid 

(D) TOPOLOGY s linear 

(ii) MOLECULE TYPEs protein 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Lys Gly Val Aep Tyr Glu Ala Lys Thr Ser Tyr Thr Leu Arg He Glu 
1 5 10 15 

Ala Ala Asn Arg Asp Ala Asp Pro Arg Phe Leu Ser Leu Gly Pro Phe 

50 20 25 30 

Ser Aep Thr Thr Thr Val Lys He He Val Glu Asp Val Asp Glu Pro 
35 40 45 
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Pro Tyr Ser 
50 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY:, linear 

(ii) MOLECULE TYPE: cDNA 



(xl) SEQUENCE DESCRIPTION: SEQ ID N0:37: 
15 AAGCCACTTC ACTATGAGAA CCGAAGACTA TATACACTGA 2WGTGGAGGC AGAAAATACC 60 

CATGTCGATC CACGTTTTTA CTATTTAGGG CCATTCAAAG ATACAACAAT TGTAAAAATC 120 
TCCATAGAAG ACGTGGATGA GCCACCCCCC TTT 153 
(2) INFORMATION FOR SEQ ID NO: 38: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY* linear 

25 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION r SEQ ID NO:38: 

Lye Pro Leu Asp Tyr Glu Asn Arg Arg Leu Tyr Thr Leu Lys Val Glu 

X 5 10 15 

Ala Glu Asn Thr His Val Asp Pro Arg Phe Tyr Tyr Leu Cly Pro Phe 
20 25 30 

Lye Asp Thr Thr He Val Lys He Ser He Glu Asp Val Asp Glu Pro 
35 40 45 

Pro Pro Phe 
50 

(2) INFORMATION FOR SEQ ID NO: 39: 

40 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



30 



35 



45 



(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

AGGGGTGTCG ATTATGAAAC CAAAAGAGCA TATAGCTTGA AGGTAGAGGC GGCCAATGTA 60 

50 CACATTGATC CCAAGTTCAT CAGCAATGCA CCTTTCAAGC ACACAGTGAC TCTCAAGATT 120 

GCAGTAGAAG ATGCCAATCA CCCCCCTCCC TTC 153 



55 
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w 



20 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Arg Gly Val Asp Tyr Glu Thr Lyo Arg Ala Tyr Ser Leu Lys Val Clu 
15 10 15 

Ala Ala Asn Val His He Asp Pro Lys Phe He Ser Asn Gly Pro Phe 

15 20 25 30 

Lys Asp Thr Val Thr Val Lys He Ala Val Glu Asp Ala Asn Glu Pro 
35 40 45 



Pro Pro Phe 
50 



.(2) INFORMATION FOR SEQ ID NO:41: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3136 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDED NESS * single 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE* cDNA 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

GGCACGAGCG CAAGCOGGGG AGCGCTCGGC CCACAATTAG TGGATGGATT TGGAATCTCC 60 

CTCOCTCCTC CAAGCTCCGC CACTGCCACT TTAGGCAGAG ACCTGAGCGT CAACAOGCGA 120 

35 GCCGTACTTT TAGGCTCCGG ACACTCAGCC CAGCGOGOCA GCTTCGCATC TCCGCACCAG 180 

GCTCCACAGC TCGGAGAGGC ATGAACGCGA TCCGGAGGAG ACTACCCTGC GCGCGGGGAT 240 

CCCTGGACAT TAGCCCCTCT OGGGAACTGA CCCCCAGCTC CTTCAGCCAT TTATCAATCC 300 

40 AGAGCCTTGA G ATTTTTTT C OGCATCCCGG AGCCCGACCT CAGAAATTTC AATCAAAAGG 360 

AAAGTCAATC GATCGTGGTC TTGGAAAAGC TGCTTAGACA TGTCTGTTTC CCGGCTCTCT 420 

GAACCCGTCC CACAGCTGTA AGTAAGCGCT TCACAGTGOG TGATGAATTO GATCGCTTCG 480 

GACCCGAGGC AAAAAAAATA ATTGTCTCAT TTTCCTCCTG ATTTGCTTAA CTGGTGGGAC S40 

CATCCCACAA. AGCCTAGCTG AGACCCTTTT GGACCTCTGG ACTCCATTAA TAATATTATG 600 

GATTACTCTT CCCTCTTTTG TCTACATGGC TCCGATGAAT CAGGCTCACG TTTTAACTAC 660 

TGGATCCCCT TTGGAACTAA CCAGGCAGAG TGAAGAAATG CGC ATTTTG A ACCGCTCCAA 720 

AACAGGTTGG GTTTGGAATC AA A T C I HU T TCTGCAAGAA TTTTCTGCAC CTGAACCGAT 780 

TCTCGTTCGC CGGTTACACA CAGATCTGGA TCCTGGGAGC AAAAAAATCA ACTATATCCT 840 
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ATCCCCTCAT GCAGCCCGCA CAATCTTTCA AATAAACGAT ATAACTCCAG ACATCCATCC 
TATCAAAAGA CTTCACCCAG ACGAAAACGC TGAGTATACG TTAACAGCTC AGCCAGTGGA 
CTGCCACACA AACAAACCTC TCGAGCCTCC TTCTGAATTT ATTATTAAGC TTCAAGACAT 
CAACGACAAT CCCCCCCAGT TTCTCAATGG ACCTTACCAT GCTACTCTTC CACAGATCTC 
CATCTTGGCT ACATCTGTCA CTAATGTAAC GCCCXCTCAT GCTGACGATC CAGTTTATGC 
AAACACTGCA AACTTGCTTT ACAGTATCTT GGAGCGACAG CCCTATTTTT CCATTGAGCC 
TCAAACAGCT ATTATAAAAA CTCCCCTTCC TAACATGGAC AGAGACGCCA AG6AGGAATA 
CCTGCTTCTA ATTCAAGOCA AAGATATGGG TGCGCATTCC GCTCGTCTCT CTCGAACCAC 
GACACTCACA GTGAOGCTTA CCGATGTGAA TCACAATCCT CCAAAATTTG CTCAAAGTTT 
GTATCACTTC TCAGTACCAG AAGATGTGGT CCTTCGCACT GCAATAGCAA CCGTTAAACC 
CAATGACCAG CATATTGGTC AAAATGCACA ATCTTCCTAT GACATCATTG ATGCAGATGG 
GACAGCACTA TTTGAAATCA CTTCTGATGC CCAGGCACAG GATCCTCTTA TAAGACTAAG 
AAAGCCTCTG CACTTTGAGA CCAAAAAATC CTATACTCTG AAGGTGGAGG CAGCCAATAT 
CCACATCGAC CCACGTTTCA GTGCCACGGG AOCCTTTAAA CATACAGCAA CAGTCAAAAT 
TCTTGTAGAG GATGCTGATG AGCCTCCCGT CTTCTCTTCA CCCACTTACC TCCTTGAAGT 
TCATGAAAAT CCTCCCTTGA ACTCTGTGAT TGGCCAAGTG ACAGCTCOTG ACCCTGATAT 
CACTTCCAGC CCAATAAGGT TTTCCATTGA CCCCCKCACT GACTTCGAGA CACAGTTCAA 
CATGAATGCA GATCATGGGA AGATAACACT GCCGACCCCA CTGGACAGAG AACTAAGTGT 
CTCGCACAAC ATCTCCATCA TTCCTACTCA GATCAGCAAC CACAGTCAGA TATCGCGACT 
GCCTCTTCCT ATTAAAGTGC TGGATGTCAA TGACAACCCC CCTCAATTCG CGTCCCAATA 
TGAGGCATTT TTATGTGAAA ATCGAAAACC CGCCCMICTC ATTCAAACAG TAAGCGCCAT 
GGACAAAGAC GATCCCAAAA ATGGACATTT TTTCTTGTAC AGTCTTCTTC CAGAAATGGT 
CAACAACOCA AATTTCACCA TCAAGAAAAA OGAWJATAAT TCCCTGAGCA TTCTCGCAAA 
ACATAATGGA TTCAACCGCC AGAAGCAAGA AGTCtACCTT CTGCCTATCG TCATCAGTGA 
CAGTGGGAAC CCCCCTCTGA GTAGCACCAG 1AOCCTGACC ATCOGOCTCT CTGGCTGTAG 
CAATCAOGGC GTGCTTCAGT OGTGCAATCT CGAAGCTTAT CTCCTTCCTA TTGGGCTCAG 
TATGGGCGCG TTAATTGCTA TATTAGCCTC CATCATTTTG CTGCTCCTCA TTGTGGTTCT 
GTTCOTTACC CTCAGCCGCC ATAAAAATGA ACCACTAATA ATCAAAGATG ATGAAGACGT 
TOGAGAAAAC ATCATTCGCT ACGACGACGA AGCAGGCCGG CAGGAGGACA CAGACGCTTT 
TGACATTGCA ACTTTGCAAA ACCCAGATGG AATTAATGGA TTTTTAOCCC GTAACGATAT 
TAAACCAGAT TTGCAGTTTA TGCCAAGGCA AGCGCTTGCT CCAGTTCCAA ATGGTCTTGA 
TGTCGATGAA TTTATAAATG TAAGGCTTCA TGAGGCAGAT AATCACCCCA CGGCCCCACC 
ATATGACTCC ATTCAGATTT ATGCCXATCA AGGCCGAGGG TCTGTGGCTG CCTCTCTCAG 
CTOGTTGGAG TCCACCACAT CAGACTCAG A CCAGAATTTT GACTACCTCA GTGACTCCGG 
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TCCCCGCTTT AAG AGACTCG GCGAACTCTA CTCTCTTCGT GAAAGTGACA AAGAAACTTG 2940 

ACACTGGATT ACATAAATAA TCAATGGAAC TGAGCATTCT GTAATATTCT AGGGTCACTC 30O0 

b CCCTTAGATG CAACAAATGT GGCTATTTGT TTTAGAGCCA AGTTTACCAC CAATCATCTA 3060 

TAAACTCAAC CACATTTTAA TGTTGAACCA AAAAAAATAA TAAAAAATAA AAAGTATATC 3120 

TTAGGAGGTG AAAAAA 3136 

"> (2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 799 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 



15 



20 



25 



30 



Met Pro Glu Arg Leu Ala Glu Thr Leu Leu Asp Leu Trp Thr Pro Leu 
1 5 10 15 

lie lie Leu Trp lie Thr Leu Pro Ser Phe Val Tyr Met Ala Pro Met 
20 25 30 

Asn Gin Ala His Val Leu Thr Thr Gly Ser Pro Leu Glu Leu Ser Arg 
35 40 45 

Gin Ser Glu Glu Met Arg lie Leu Asn Arg Ser Lys Arg Gly Trp Val 
50 55 60 

Trp Asn Gin Met Phe Val Leu Glu Glu Phe Ser Gly Pro Glu Pro He 
65 70 75 80 

Leu Val Gly Arg Leu His Thr Asp Leu Asp Pro Gly Ser Lys Lys He 
85 90 95 

Lys Tyr He Leu Ser Gly Asp Gly Ala Gly Thr He Phe Gin He Asn 
100 105 110 

35 Asp He Thr Gly Asp He His Ala He Lys Arg Leu Asp Arg Glu Glu 

115 120 125 

Lys Ala Glu Tyr Thr Leu Thr Ala Gin Ala Val Asp Trp Glu Thr Asn 
130 135 140 

40 Lys Pro Leu Glu Pro Pro Ser Glu Phe He He Lys Val Gin Asp He 

145 150 155 160 

Asn Asp Asn Ala Pro Glu Phe Leu Asn Gly Pro Tyr Hio Ala Thr Val 
165 170 175 

Pro Glu Met Ser He Leu Gly Thr Ser Val Thr Asn Val Thr Ala Thr 
45 180 165 190 

Asp Ala Asp Asp Pro Val Tyr Gly Asn Ser Ala Lys Leu Val Tyr Ser 
195 200 205 

He Leu Glu Gly Gin Pro Tyr Phe Ser He Glu Pro Glu Thr Ala He 

50 210 * ■ . 1 215 220 

He Lys Thr Ala Leu Pro Asn Met Asp Arg Glu Ala Lys Glu Glu Tyr 
225 230 235 240 
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Leu Val Val He Gin Ala l>y. Aep Met Cly Cly Hie Ser Gly Gly Leu 
245 * 50 

Ser Cly Thr Thr. Thr Leu Thr Val Thr Leu Thr Asp Val Asn Ab P Asn 
260 265 270 

Pro Pro Lys Phe Ala Gin Ser Leu Tyr HI. Phe Ser Val Pro Glu Asp 
275 280 285 

Val Val Leu Cly Thr Ala lie Gly Arg Val Lys Ala Aen Asp Gin Aep 
290 295 300 

lis Cly Glu Asn Ala Gin Ser Ser Tyr Aap lie He Aap Cly Asp Cly 
305 310 315 320 

Thr Ala Leu Phe Clu He Thr Ser Aap Ala Gin Ala Gin Aap Cly Val 

325 330 335 

He Arg Leu Arg Lya Pro Leu Aap Phe Clu Thr Lya Lya Ser Tyr Thr 
340 345 35Q 

Leu Lya Val Clu Ala Ala Aan lie His He Aap Pro Arg Phe Ser Cly 
355 360 365 

Arg Gly Pro Phe Lya Aap Thr Ala Thr Val Lya lie Val Val Glu Aap 
370 375 380 

Ala Aap Glu Pro Pro Val Phe Ser Ser Pro Thr Tyr Leu Leu Glu Val 
385 390 3^5 400 

His Glu Aan Ala Ala Leu Asn Ser Val He Cly Gin Val Thr Ala Arg 
405 410 *15 

Asp Pro Aap He Thr Ser Ser Pro He Arg Phe ser Xle Asp Arg His 
420 425 430 

Thr Asp Leu Glu Arg Gin Phe Aan He Aan Ala Asp Aep Cly Lya He 
435 440 44S 

Thr Leu Ala Bur Pro Leu Asp Arg Clu Leu Ser Val Trp His Aan He 
450 455 460 

Ser He He Ala Thr Clu He Arg Aan Hia Ser Gin He Ser Arg Val 
465 470 475 4BQ 

Pro Val Ala He Lya Val Leu Aap Val Aan Asp Asn Ala Pro Glu Phe 
485 490 495 

Ala Ser Glu Tyr Glu Ala Phe Leu Cys Glu Aan Gly Lya Pro Gly Gin 
500 505 5IO 

Val He Gin Thr Val Ser Ala Met Asp Lys Asp Asp Pro Lys Asn Gly 
515 520 525 

His Phe Phe Leu Tyr Ser Leu Leu Pro Glu Met Val Aan Asn Pro Asn 
530 535 540 

Phe Thr He Lys Lys Asn Clu Aap Aen Ser Leu Ser He Leu Ala Lys 
545 550 555 560 

His Asn Cly Phe Asn Arg Gin Lys Gin Clu Val Tyr Leu Leu Pro He 

565. 570 575 

Val He Ser Aap Ser Cly Asn Pro Pro Leu Ser Ser Thr Ser Thr Leu 
580 585 590 
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Thr lie Arg Val Cys Cly Cys Ser Asn Asp Gly Val Val Gin Ser Cys 
595 600 605 

Asn Val Glu Ala Tyr Val Leu Pro lie Gly Leu Ser Met Gly Ala Leu 
610 615 620 

He Ala He Leu Ala Cys He He Leu Leu Leu Val He Val Val £eu 
625 630 635 640 

Phe Val Thr Leu Arg Arg His Lys Asn Glu Pro Leu He He Lys Asp 
645 650 655 

Asp Glu Asp Val Arg Glu Asn lie lie Arg Tyr Asp Asp Glu Gly Gly 
660 665 670 

Gly Glu Glu Asp Thr Glu Ala Phe Asp He Ala Thr Leu Gin Asn Pro 
675 680 685 

Asp Gly He Asn Gly Phe Leu Pro Arg Lys Asp He Lys Pro Asp Leu 
690 695 700 

Gin Phe Met Pro Arg Gin Gly Leu Ala Pro Val Pro Asn Gly Val Asp 
,705 710 715 720 

Val Asp Glu Phe He Asn Val Arg Leu His Glu Ala Asp Asn Asp Pro 
725 730 735 

Thr Ala Pro Pro Tyr Asp Ser He Gin He Tyr Gly Tyr Glu Gly Arg 
740 745 750 

Gly Ser Val Ala Gly Ser Leu Ser Ser Leu Glu Ser Thr Thr Ser Asp 
755 760 765 

Ser Asp Gin Asn Phe Asp Tyr Leu Ser Asp Trp Gly Pro Arg Phe Lys 
770 775 780 

Arg Leu Cly Glu Leu Tyr Ser Val Gly Glu Ser Asp Lys Glu Thr 
785 790 795 

(2) INFORMATION FOR SEQ ID N0:43: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3043 base pairs 

(B) TYPE* nucleic acid 

(C) STRANDED NESS i single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEr cDNA 



- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GGCACGAGCG GAAGCOGGGG AGCGCTOGGC CGAGAATTAG TGGATCGATT TGGAATCTCC 
CTGCCTCCTC GAAGCTCCGC CACTGCCACT TTAGCCAGAG ACCTGAGCGT CAACACGCGA 
GCCGTACTTT TAGGCTGCGG ACACTGAGCC CAGCGCGCCA CCTTCCCATC TCCGCACCAG 
GCTCCACAGC TCGCAGAGGC ATGAACCCGA TCCGGAGGAG ACTACCCTCC GCGCGCCGAT 
CCGTGGACAT TA(*CCGCTCT CGGGAACTGA CCCCCAGCTC CTTCAGCCAT TTATGAATCC 
AGAGGCTTCA G AIT T I T I TC OGCATCCCGG AGCCCGACCT CAGAAATTTC AATGAAAAGG 
AAAGTCAATG CATCGTGGTC TTGGAAAAGC TGCTTAGACA TGTCTGTTTC 
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40 
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50 



GAACCCGTGC CAGAGCTGTA AGTAACCGCT TCACAGTGCG TGATGAATTG GATGGCTTCG 
GACCCGAGGC AAAAAAAATA ATTGTCTCAT TTTCGTGCTG ATTTGCTTAA CTGGTCGCAC 
CATGCCAGAA AGGCTAGCTG AGAOG CTTTT GGACCTCTGG ACTCCATTAA TAATATTATG 
GATTACTCTT CCCTCTTTTG TGTACATGGC TOOGATGAAT CAGCCTCACG TTTTAACTAC 
TGGATCCCCT TTGGAACTAA GCAGGCAGAC TGAAGAAATC OGCATTTTGA ACCGCTCCAA 
AAGAGGTTGG GTTTGGAATC AAATGTTTGT TCTGGAAGAA TTTTCTGGAC CTGAACCGAT 
TCTCCTTGGC CGGTTACACA CACATCTGGA TCCTGCCAGC AAAAAAATCA AGTATATCCT 
ATCGCGTGAT CCAGCCGCCA CAATCTTTCA AATAAACGAT ATAACTGCAG ACATCCATCC 
TATCAAAAGX CTTGACCGAG AGGAAAAGGC TGAGTATACG TTAACAGCTC AGGCAGTGGA 
CTGGGAGACA AACAAACCTC TCGAGCCTCC TTCTCAATTT ATTATTAAGG TTCAAGACAT 
CAACGACAAT GCCCCCGAGT TTCTCAATGG ACCTTACCAT GCTACTGTTC CAGAGATGTC 
CATCTTGGGT ACATCTGTCA CTAATGTAAC GGCCACTGAT GCTGAOGATC CAGTTTATGG 
AAACAGTCCA AAGTTGCTTT ACAGTATCTT GGAGGGACAG CCGTATTTTT CCATTGAGCC 
TGAAACAGCT ATTATAAAAA CTGCCCTTCC TAACATGGAC AGAGAGGCCA AGGAGGAATA 
CCTGGTTGTA ATTCAAGCCA AAGATATGGG TGGGCATTCC GGTGGTCTGT CTGGAACCAC 
GACACTCACA GTCACGCTTA CCGATGTGAA TGACAATCCT CCAAAATTTG CTCAAAGTTT 
GTATCACTTC TCAGTACCAG AAGATGTGGT CCTTGGCACT GCAATAGGAA GGCTTAAAGC 
CAATGACCAG GATATTGGTC AAAATGCACA ATCTTCCTAT GACATCATTG ATGCAGATGG 
GACAGCACTA TTTGAAATCA C TTCT G ATGC CCAGGCACAG GATGGTGTTA TAAGACTAAG 
AAAGCCTCTG GACTTTGAGA CCAAAAAATC CTATACTCTG AAGGTCCAGG CAGCCAATAT 
CCACATCGAC CCAOGTTTCA GTGCCAGGGG ACCCTTTAAA GATACACCAA CAGTCAAAAT 
T G TT G TAGAG GATGCTGATG AGCCTCCCGT CTTCTCTTCA CCGACTTACC TCCTTGAAGT 
TCATGAAAAT GCTGCCTTGA ACTCTCTGAT TGGCCAAGTG ACAGCTCCTG ACCCTGATAT 
CACTTCCAGC CCAATAAGGT TTTCCATTGA CCGCCACACT GACTTGGAGA GACAGTTCAA 
CATCAATGCA GATGATGCGA AGATAACACT GGCGACCCCA CTCCACAGAG AACTAACTGT 
GTCCCACAAC ATCTCCATCA TTCCTACTGA GATCAGCAAC CACAGTCAGA TATCGCGAGT 
GCCTGTTCCT ATTAAAGTGC TGGATGTCAA TGACAACGCC CCTGAATTCG CGTCCGAATA 
TGACGCATTT TTATCTCAAA ATGGAAAACC CGGCCAAGTA AATATCTCCA TGTTGTTAAT 
ACTGAATATG TTTGTATACA ACTGTTTCCT AGTTAATTAA CCTGCATTAC TTCCTGATTT 
TGCATTCGTT GGATTTACAA ACTCACAGGC AGGAAACTCC TCCAAGCGGT AACAGAAGGG 
AATATTTGTC TTTCTCAGAX GTTAATTCTC TTCTAACTTA CGAACCAATT GGCTCAGAAA 
GTCTGATCAT CTOCTCTCCT CTCACCCCAG CCAAATCACT CTCTTAAAAT ACATCACATA 
TGGGTCATGG CTGGGGACAG TCTTACAGTG CAGAAGGTTG AAATCGCCAT CAATTGGCAA 
GAATCTAAAG AATAGCTCAT GCGAAGCATG CATTTTTCTT TTATGTTGAA AAGAAGATTA 
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ATCCACAAAT GTGGAATGCA AAAAAACACA CTACTTTATA GAAACCTCTA TCTAGTGGTA 2520 

CTTATGTCTC TACACATATT TGCAAGTTTA GTAAACATAA TCTAGACATC AAATTGJTAG 2580 

5 ATATGCCCCT AACCCATTTC AATATG TAG A GGTAAGACTC CTAAGGCATA GATGGGGATA 2640 

ATGAAGACAA AAATAAAGGG CAGAAAAATG TATAAAATAG AACAGACAGA AATACACTAA 2700 

AGATCTAAAG ATAGAAGCAG GAAAGAGGGC AGGGAGGGAG GGACACAGGG CTGGAAGAAG 2760 

W ATAGGGTGGG AGCGAGGGAA GGAGAGTCAA GGCTCAGGGT CTGGGGGGGA AGGTAAAATG 2820 

CAAAACAAAA TCTAGAGAAA CCACTATACT CTGAKTGTGA AAATGCAACT AACCTATGTA 2880 

AAATGACCCA ACCACATGTG TAATAGATTT ATTTTAAOGA GGTGCCGGAG TACTGTATGT 2940 

TTAAGAAATT TATCATTTTT CAACTTCCTA ATTTATTTCT GGATGGTGAC ATTTTAATTT 3000 

AAATAAACAG CAGCTGACAG CATGAAAAAA AAAAAAAAAA AAA 3043 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 532 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES protein 

(Xl) SEQUENCE DESCRIPTIONS SEQ ID NO: 44: 

Met Pro Glu Arg Leu Ala Glu Thr Leu Leu Asp Leu Trp Thr Pro Leu 
15 10 15 

He He Leu Trp He Thr Leu Pro Ser Phe Val Tyr Met Ala Pro Met 
20 25 30 

Asn Gin Ala Bis Val Leu Thr Thr Gly Ser Pro Leu Glu Leu Ser Arg 
35 40 45 

Gin Ser Glu Glu Met Arg He Leu Asn Arg Ser Lys Arg Gly Trp Val 
50 55 60 

35 Trp Asn Gin Met Phe Val Leu Glu Glu Phe Ser Gly Pro Glu Pro He 

65 70 75 80 

Leu Val Gly Arg Leu His Thr Asp Leu Asp Pro Gly Ser Lys Lys He 
85 90 95 
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Lys Tyr He Leu Ser Gly Asp Gly Ala Gly Thr He Phe Gin He Asn 
100 105 110 

Asp He Thr Gly Asp He His Ala He Lys Arg Leu Asp Arg Glu Glu 
115 120 125 

Lys Ala Glu Tyr Thr Leu Thr Ala Gin Ala Val Asp Trp Glu Thr Asn 
130 135 140 

Lys Pro Leu Glu Pro Pro Ser Glu Phe He He Lys Val Gin Asp He 
145 150 155 160 

Asn Asp Asn Ala Pro Glu Phe Leu Asn Gly Pro Tyr His Ala Thr Val 
165 170 175 

Pro Glu Met Ser He Leu Gly Thr Ser Val Thr Asn Val Thr Ala Thr 
180 185 190 
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Asp Ala Aap Asp Pro Val Tyr Cly Aan Ser Ala Lye Leu Val Tyr Ser 
195 200 205 

lie Leu Clu Cly Gin Pro Tyr Phe Ser lie Clu Pro Clu Thr Ala He 
210 215 220 

He Lye Thr Ala Leu Pro Asn Met Asp Arg Clu Ala Lys Clu Clu Tyr 
225 230 235 240 

Leu Val Val He Gin Ala Lys Aap Met Cly Cly His Ser Cly Gly Leu 
245 250 255 

Ser Cly Thr Thr Thr Leu Thr Val Thr Leu Thr Aap Val Aan Aap Aan 
260 265 270 

Pro Pro Lye Phe Ala Gin Ser Leu Tyr His Phe Ser Val Pro Clu Asp 
275 280 285 

Val Val Leu Cly Thr Ala He Cly Arg Val Lys Ala Aan Aap Cln Aap 
290 295 300 

He Gly Glu Aan Ala Gin Ser Ser Tyr Asp He He Asp Gly Aap Gly 
305 310 315 320 

Thr Ala Leu Phe Glu He Thr Ser Aap Ala Cln Ala Gin Aap Gly Val 
325 330 335 

He Arg Leu Arg Lya Pro Leu Aap Phe Glu Thr Lye Lya Ser Tyr Thr 
340 345 350 

Leu Lya Val Glu Ala Ala Aan He Hla He Aap Pro Arc Phe Ser Gly 
355 360 365 

Arg Cly Pro Phe Lya Aap Thr Ala Thr Val Lya He Val Val Clu Aap 
370 375 380 

Ala Aap Clu Pro Pro Val Phe Ser Ser Pro Thr Tyr Leu Leu Glu Val 
385 390 395 400 

Hie Clu Aan Ala Ala Leu Aan Ser Val He Gly Gin Val Thr Ala Arg 
405 410 415 

Aap Pro Asp He Thr Ser Ser Pro He Arg Phe Ser He Aap Arg His 
420 425 430 

Thr Aap Leu Clu Arg Gin Phe Aan He Aan Ala Aep Aap Cly Lya He 
435 440 445 

Thr Leu Ala Thr Pro Leu Asp Arg Glu Leu Ser Val Trp Hie Aan He 
450 455 460 

Ser He He Ala Thr Glu He Arg Aen Hie Ser Gin He Ser Arg Val 
465 470 475 480 

Pro Val Ala He Lye Val Leu Asp Val Aan Asp Aen Ala Pro Glu Phe 
485 490 495 

Ala Ser Glu Tyr Glu Ala Phe Leu Cye Clu Aen Cly Lye Pro Gly Gin 
500 505 510 

Val Aen He Ser Wet Leu Leu He Leu Aen Met Phe Val Tyr Aen Cye 
515 520 525 

Phe Leu Val Aen 
530 
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(2) INFORMATION FOR SEQ 10 NO: 45: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 2490 base pairs 
( B J TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY t linear 

(ii) MOLECULE TYPES CDNA 



(xi) SEQUENCE DESCRIPTION! SEQ ID NO:45i 

GCCACGAGGG CCACTTGACC CACACTCAGA ATTTGTGATC AAAATTCACG AT ATCAACG A 60 

CAATCAGCCT ACATTCCCAG AACAAATTTA TACAGCCAGC GTTCCTCAAA TGTCTGTTCT 120 

AGGTACTTCT GTGCTGCAAC TCACAGCTAC AGATGCCGAT CACCCTTCAT ATGGAAACAG 180 

CCCCAGAGTC ATTTACAGCA TACTTCAAGC GCAGCCTTAT TTCTCTGTGG AACCAGAAAC 240 

AGGTATCATA AGGACAGCTC TACCAAACAT GAACACAGAG AACAAGGAAC AG TACCAGG T 300 

GGTTATTCAA GCCAAGGACA TGGGCGCTCA GATGGGGGGT CTGTCTCGAA CCACCACAGT 360 

CAACATCACT CTCACAGATG TCAACGACAA TCCTCCTCGC TTCCCCCAAA ACACCATCCA 420 

TCTGCGAGTT CTTGAATCCT CTCCAGTTGG CACACCTGTG GGAAGTCTAA AAGCCACOCA 480 

2 5 TOCTGACACG GGGAAAAATC CCGAAGTGGA TTACCCCATT ATTGATGGAG ATGGCACAGA 540 

TATGTTTOAC ATTATAACTO AGAAGGACAC ACAGCAACCC ATCATCACTG TCAAAAAGCC 600 

ACTTGACTAT GAGAACCCAA CACTATATAC TCTGAACCTC CAGGCAGAAA ATACCCATGT 660 

GGATCCACCT TTTTACTATT TAGGGCCATT CAAAGATACA ACAATTGTAA AAATCTCCAT 720 

AGAAGAOGTG GATCAGCCTC CAGTTTTCAG TCGATCCTCC TATCT C TTT G AGGTTCATCA 780 

CGATATTGAA GTCGGCACAA TCATCGGTAC TCTAATGGCA AGAGACCCAG ATTCTACTTC 640 

CAGTCCCATC AGATTTACTT TAGATCGCCA TACTGATCTT CACAGGATCT TTAACATTCA 900 

TTCTGGAAAC GGATCACTTT ATACATCAAA GCCACTTGAT CCTCAACTAT CTCAATGGCA 960 

CAACCTTACC GTCATAGCTG CCGAGATCAA TAATCCTAAA GAAACAACTC GTCTGTCTCT 1020 

TTTTCTGACG ATTTTGGATG TTAATGACAA CGCTCCACAA TTTGCTGTGT TTTATCACAC 1080 

40 ATTTCTATGT GAAAATGCCA GACCAGGACA GCTGATACAG ACAATAAGTG CAGTTGACAA 1140 

AGATGACCCC TTAGGTGGAC AGAAGTTCTT CTTCAGTTTC CCTGCTGTGA ATCCTAACTT 1200 

CACACTGCAA CACAATGAAG ACAACACTCC CAGAATTTTA ACCAGAAACA ATGGCTTCAA 1260 

CCGTCATCAA ATAAGCACCT ACCTACTGCC GCTAGTGATA TCTGATAATG ACTACCCCAT 1320 



TCAGAGCACC ACTGGCACCC TGAOGATCCC TGTTTCCCCC TGTGACAGCC AGGGCAACAT 1380 

GCAGTCCTGC AGTGCCGAAG CCCTGCTCCT TCCTCCTCGC CTCAGCACTC CCGCCTTGAT 1440 

OGCC AT TC TT CTCTGCATCA TCATTCTGCT GCTTATAGTA CTCCTCTTT C CACCCCTGAA 1S00 

AACGCAACGG AAGAAAGAGC CTCTGATTTT ATCCAAAGAA GACATCAGAG ACAACATTGT 1560 

CAGCTATAAC GACGAAGCTG CCGGAGAGGA GGACACCCAA CCCTTTGATA TTGGAACCCT . 1620 



37 



EP1 074 618 A2 



w 



15 



GAGCAATCCT GCACCTATCC AGGAGAAAAA CCTGCGGCCA CATATCATTC CTGAAACCTT 1680 

ATTTATACCG OGGCCGACTC CTAOCCCOCC CGATAACACC GATCTCCGGG ATTTCATTAA 1740 

TGACCCCCTC AAAGAGCACG ACTTGCACCC CACTGCGCCT CCCTACGACT CGCTGQCTAC 1800 

CTATGCCTAT CAACCAAACG ACTCTGTTGC TGAATCTCTC AGCTCCTTAG AATCAGGTAC I860 

CACTGAAGGA GACCAAAACT ACGATTACCT TCGAGAATGG GGGCCTOGGT TTAATAAACT 1920 

AGCAGAAATG TACGCTGGTG GTCAGAGCGA CAAAGAOGCT TAGCCTCGCC CCTGAGCTCT 1980 

GTTCAACGAG ATACGTAACT TTGCAGACAT TGTCTCCACT TCACAATATT TGATATTCAG 2040 

GAGAAAAAAT TCCTGCCACT CAGCACAAGT TTCOCACCTA TTTCTTAATT TGTTCATTAA 2100 

TTATATTAAT TCCTTCCTGT AGAATGTCTC ATGGGATATA TACGACATTT TATTTAATCA 2160 

CTTCCAAGAG CCAAAGCTAT GGAAATTCAA TCTTGCCCAT CTTAGTAAAT AAAAGAAACC 2220 

CGAGCAGGAT AGTTCTCCCT TAAGCAACCT CACGAACAAG TCGCTTCTGT TACATACACC 2280 

TCTTGCCCTT GCAAATGAAG CTTTGAAAAG AOGAAGAAAA CATTTAAGAT GTATCCTGTT 2340 

20 CTGTACATTA AGTTTAAAAA AAAAACTCCA TCTGGTGTTA CTAGCTGTGA TATCCACCCT 2400 

CGTATACGAG CATTCGTCCA ATTTCATTTC ATCAAATTCT ATCTGCTAAT GTTTTATATT 2460 

TATATTTTTG TATTTATTTT TTAAAAAAAA 2490 
25 (2) INFORMATION FOR SEQ 10 NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 653 amino acids 

(B) TYPE* amino acid 
(D) TOPOLOGYt linear 

30 (ii) MOLECULE TYPE* protein 

(Xi) SEQUENCE DESCRIPTION* SEQ ID NO*46: 

Ala Arc Cly Pro Val Glu Pro Glu Ser Glu Phe Val He Lya He Hie 
1*5 10 15 
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Asp He Aan Aap Aan Glu Pro Thr Phe Pro Glu Glu He Tyr Thr Ala 
20 25 30 

Ser Val Pro Glu Ket Ser Val Val Gly Thr Ser Val Val Gin Val Thr 
35 40 45 

Ala Thr Aap Ale Aap Acp Pro Ser Tyr Gly Aan Ser Ala Arg Val He 

50 55 60 

Tyr Ser He Leu Gin Gly Gin Pro Tyr Phe Ser Val Glu Pro Glu Thr 
65 70 75 80 

Gly He He Arg Thr Ala Leu Pro Aan Met Aan Arg Glu Aan Lya Glu 
85 90 95 

Gin Tyr Gin Val Val lie Gin Ala Lya Aap Met Gly Gly Gin Met Gly 
100 ; 105 110 

Gly Leu Ser Gly Thr Thr Thr Val Aan He Thr Leu Thr Asp Val Ann 
115 120 125 

Asp Aan Pro Pro Arg Phe Pro Gin Aan Thr He Hie Leu Arg Val Leu 
130 135 140 



38 



EP1 074 618 A2 



Glu Ser Ser Pro Val Gly Thr Ala Val Gly Ser Val Lys Ala Thr Asp 
145 150 155 160 

Ala Asp Thr Gly Lyo Asn Ala Glu Val Aep Tyr Arg He . He Asp Gly 
165 170 175 

Asp Gly Thr Asp Het Phe Asp He He Thr Glu Lys Asp Thr Gin Glu 
180 185 190 



Gly He He Thr Val Lys Lys Pro Leu Asp Tyr Glu Asn Arg Arg Leu 
195 200 205 



15 



Tyr Thr Leu Lys Val Glu Ala Glu Asn Thr His Val Asp Pro Arg Phe 
210 215 220 

Tyr Tyr Leu Gly Pro Phe Lys Asp Thr Thr He Val Lys He Ser He 
225 230 235 240 

Glu Asp Val Asp Glu Pro Pro Val Phe Ser Arg Ser Ser Tyr Leu Phe 
245 250 2SS 

Glu Val His Glu Asp He Glu Val Gly Thr He lie Gly Thr Val Met 
260 265 270 

Ala Arg Asp Pro Asp Ser Thr Ser Ser Pro He Arg Phe Thr Leu Asp 
275 280 285 



25 



Arg His Thr Asp Leu Asp Arg He Phe Asn He His Ser Gly Asn Gly 
290 295 300 

Ser Leu Tyr Thr Ser Lys Pro Leu Asp Arg Glu Leu Ser Gin Trp His 
305 310 315 320 



Asn Leu Thr Val He Ala Ala Glu He Asn Asn Pro Lys Glu Thr Thr 
325 330 335 



30 



Arg Val Ser Val Phe Val Arg He Leu Asp Val Asn Asp Asn Ala Pro 
340 345 350 



Gin Phe Ala Val Phe Tyr Asp Thr Phe Val Cys Glu Asn Ala Arg Pro 
355 360 365 

Gly Gin Leu Zle Gin Thr Zle Ser Ala Val Asp Lys Asp Asp Pro Leu 
370 375 380 

Gly Gly Gin Lys Phe Phe Phe Ser Leu Ala Ala Val Asn Pro Asn Phe 
385 390 395 400 

Thr Val Gin Asp Asn Glu Asp Asn Thr Ala Arg Zle Leu Thr Arg Lys 
405 410 415 

Asn Gly Phe Asn Arg His Glu Zle Ser Thr Tyr Leu Leu Pro Val Val 
420 425 430 
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Zle Ser Asp Asn Asp Tyr Pro Zle Gin Ser Ser Thr Gly Thr Leu Thr 
435 440 445 



Zle Arg Val Cys Ala Cys Asp Ser Gin Gly Asn Met Gin Ser Cys Ser 
450 455 460 



50 



Ala Glu Ala Leu Leu Leu Pro Ala Gly Leu Ser Thr Gly Ala Leu Zle 
465 * " 470 475 480 

Ala Zle Leu Leu cys Zle Zle Zle Leu Leu Val Zle Val Val Leu Phe 
485 490 495 
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Ala Ala Leu Lys Arg Gin Arg Lys Lys Glu Pro Leu lie Leu Ser Lys 
500 505 510 

Glu Asp He Arg Aep Aen He Val Ser Tyr Aen Asp Glu Gly Gly Gly 
515 520 525 

Glu Glu Asp Thr Gin Pro Phe Aop He Gly Thr Leu Arg Aon Pro Ala 
530 535 540 

Ala He Glu Glu Lye Lye Leu Arg Arg Asp He He Pro Glu Thr Leu 
545 550 S55 560 

Phe lie Pro Arg Arg Thr Pro Thr Ala Pro Aop Asn Thr Asp Val Arg 
565 570 575 

Aep Phe He Asn Glu Arg Leu Lye Glu His Asp Leu Asp Pro Thr Ala 

580 585 590 

Pro Pro Tyr Aep Ser Leu Ala Thr Tyr Ala Tyr Glu Cly Asn Asp Ser 
595 600 60S 

Val Ala Glu Ser Leu Ser Ser Leu Glu Ser Gly Thr Thr Glu Gly Asp 
610 615 620 

Cln Asn Tyr Asp Tyr Leu Arg Glu Trp Cly Pro Arg Phe Asn Lyo Leu 
625 630 635 640 

Ala Glu Met Tyr Cly Gly Gly Glu Ser Asp Lys Asp Ala 
645 6S0 

(2) INFORMATION FOR SEQ IP NOl47t 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH x 3048 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDEDNESSt single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE* cDKA 



<xi) SEQUENCE DESCRIPTIONS SEQ ID NO:47: 
CCOOGGCGCC GAAGATGACC COGGCOGCCG CCCTCCTCCT TCTGCTGCTC TCGCTCTCCG 
GCOCGCTCCG GCCCCATAAT GAGGATCTTA CAACTAGAGA GACCTCCAAG CCTGCGTTCT 
CTGAAGATCA TTACACGCCA TTAATCTCCC AAAATATTCT AGAAGGGGAA AACCTACTTC 
AAGTCAAGTT CAGCACCTGT GTGGGGACCA ACGCCACACA ATATGAGACC AACAGCATGG 
ACTTCAAACT TGGGGCACAT GCGACAGTCT TCCCCACCCG GGAGCTGCAG GTCCCCTCCC 
AGCAGCTGCC GTTCACGGTC ACTCCATGCG ACACCCACAC AGCAGAGAAA TGCCACGCCC 
TGGTCCGGTT GCTGGTGGCC CAGACCTCGT CCCCCCACTC TGGACACAAG CCGCAGAAAG 
GAAACAAGGT CGTCCCTCTG CACCCCTCTC CGCCTCCCAA GCACACCCTG CTCCCCTGCC 
OCCACCACCA CAACGCCAAC GCGCTGAGCC GGCCCAAACC GGACTGGGTC ATCCCACCCA 
TCAACGTOCC CGAGAACTCO OGCGGGCCCT TCCOOCAGCA GCTCGTGAGG ATCCGGTCCG 
ACAAAGACAA TGACATCCCC ATCCGGTACA GCATCAOCGG AGTGGGTGCC GACGACCCCC 
CCATCGAGGT CTTCACCATT AACTCCATCT COCCCOCCAT CTACGTCACA AGCCCCATGG 
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ACCGGGAGGA CCACGCCTCT TACCACCTCC GAGCCCACGC TGTGGACATG AATGGCAACA 780 

AGGTGGAGAA CCCCATCGAC CTCTACATCT ACGTCATCGA CATCAATGAC AACCACCCTC 840 

ACTTCATCAA CCACGTCTAC AACTCCTCCG TCGACCACCC CTCCAAGCCA GGCACCTACG 900 

TCATCACCAT CACCGCCAAC GATGCTGACG ACAGCACCAC GGCCAACGGG ATGGTGCGGT 960 

ACOGCATCGT GACCCAGACC CCACAGAGCC CGTCCCAGAA TATCTTCACC ATCAACAGCG 1020 

AGACTGGAGA TATCGTCACA GTCGCGGCTG GCTGGGACCG AGAGAAAGTT CAGCAGTACA 1080 

CAGTCATCGT TCAGGCCACA GATATCCAAG GAAATCTCAA CTATGGCCTC TCAAACACAG 1140 

CCACAGCCAT CATCACGGTG ACAGATCTCA ATCACAACCC GTCAGAATTT ACCGCCAGCA 1200 

CCTTTGCAGG GCACGTCCCC GAAAACAGCG TCGAGACOGT GGTOGCAAAC CTCACGGTCA 1260 

TGGACOGAGA TCAGCCCCAC TCTCCAAACT GGAATCCCCT TTACOGCATC ATCAGTGGGG 1320 

ATCCATCCGG GCACTTCACC GTCCGCACAG ACCCCCTAAC CAACGAGGGC ATGGTCACCG 1380 

TGGTGAAGGC AGTCGACTAC CAGCTCAACA GAGCTTTCAT GCTGACAGTG ATGGTGTCCA 1440 

ACCAGGCCCC CCTGGCCAGC GG AATCCAGA TGTCCTTCCA GTCCACGGCA GGGGTGACCA 1500 

TCTCCATCAT GGACATCAAC GAGCCTCCCT ACTTCCCCTC AAACCACAAG CTGATCCGCC 1560 

TGGAGGAGGG CGTGCCCCCC GGCACCGTCC TGACCAOGTT TTCAGCTGTC GACCCTGACC 1620 

CGTTCATGCA GCAGGCTGTG AGATACTCAA AGCTGTCAGA COCAGCCAGC TGGCTGCACA 1680 

TCAATCCCAC CAACGGCCAG ATCACCAOGG TCGCAGTCCT GGACCGTGAC TCCCTCTACA 1740 

CCAAAAACAA OCTCTAOGAG GCCACCTTCC TGGCACCTGA CAATGGGATA CCCCCGGCCA 1800 

CCGGCACCCG GACCCTCCAC ATCTATCTCA TTCACATCAA CGACAACGCC CCTCAGCTCQ I860 

TGCCCAAGGA GGOCCAGATC TCCGAGAGCC CCAACCTOAA CGCCATCAAC ATCACGGCGG 1920 

CCGACGCTCA OGTGCACCCC AACATCGGCC CCTA O GTCTT CGACCTGCCC TTTCTCCCGG 1980 

OGGCCGTGOG GAAGAACTGG ACCATCACCC GCCTGAAOGG TCACTATGCC CAACTCACCT 2040 

TGCGCATCCT GTACCTGGAG GCCGGGATCT ATGACGTCCC CATCATCGTC ACAGACTCTC 2100 

GAAACCCTCC CCTGTCCAAC ACGTCCATCA TCAAAGTCAA GGTGTGCCCA TGTGATGACA 2160 

ACGGGGACTG CACCACCATT GGCGCAGTGG GAGOGGCTGG TCTGGGCACC . GGTGCCATOG 2220 

TGGCCATCCT CATCTGCATC CTCATCCTGC TGACCATGGT CCTGCTCTTT GTCATCTGGA 2280 

TGAAGCCGCC AGAGAAGGAG CCCCACACGA AGCAGCTGCT CATTGACCCC GAGGAOGAOG 2340 

TCCGCGAAAA CATCCTCAAG TATGACGAGG AAGGCGGTGG OGAGGAGGAC CAGGACTACG 2400 

ACCTCAGCCA GCTGCAGCAG COGGAAGCGA TGGGGCACCT GCCAAGCAAA GCCCCTGGCG 2460 

TGOGTCQOGT CGATGAGCGC COGGTGGGCC CTGACCCCCA GTACCOGATC AGGCCCATGG 2520 

TGCOGCACCC AGGCGACATC GGTGACTTCA TCAATGAGCC ACTCCGOCCT GCTGACAACG • 2580 

ACCCCACGGC ACCCCCCTAT CACTCCCTCC TGGTCTTCGA CTAOGAGGGG AGCGCCTCCA 2640 

COGCAGGCTC OGTCAGCTCC CTCAACTCAT CGACTTCOGG GGACCAAGAC TAOGATTACC 2700 

TCAAOGACTG CGCCCCCAGA TTCAAGAAGC TGGOGGACAT CTATOGAGGT GGTGAAGAGG 2760 
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ATTGACTGAC CTCGCATCTT CGCACCGAAG TCAGACCOCT CCTCCCACCC CCCACGACCA 2820 

CCACTGACCA GAGCOGCCCG GTCTTCCCGA CTCCCTCCGG CTGTGTCCTT AGTGCTGTTA 2880 

CCAGGCCCCC CAATCCCCAC GTTGACCTGT CTAGCATGAG CACCCACCCC CACAGCGCCC 2940 

TGCACCOGGC CGCTGCCCAG CACCGCGCTG GCTGGCACTG AAGCACAGCA AGAGGCACTC 3000 

TGTCTTCACT TGAATTTCCT AGAACAGAAG CACTGTTTTT AAAAAAAG 3048 
(2) INFORMATION FOR SEQ ID N0:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 916 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4B: 

Met Thr Ala Cly Ala Cly Val Leu Leu Leu Leu Leu Ser Leu Ser Cly 
15 10 15 

Ala Leu Arg Ala His Asn Glu Asp Leu Thr Thr Arg Glu Thr Cys Lys 
20 25 30 

Ala Gly Phe Ser Glu Asp Asp Tyr Thr Ala Leu lie Ser Gin Asn lie 
35 40 45 

Leu Glu Gly Glu Lys Leu Leu Gin Val Lys Phe Ser Ser Cys Val Gly 
50 5S 60 

Thr Lys Gly Thr Gin Tyr Glu Thr Asn Ser Met Asp Phe Leu Val Gly 
65 70 75 80 

Ala Asp Gly Thr Val Phe Ala Thr Arg Glu Leu Gin Val Pro Ser Glu 
85 90 95 

Gin Val Ala Phe Thr Val Thr Ala Trp Asp Ser Gin Thr Ala Glu Lys 
100 105 110 

Trp Asp Ala Val Val Arg Leu Leu Val Ala Gin Thr Ser Ser Pro His 
115 120 125 

Ser Gly His Lys Pro Gin Lys Gly Lys Lys Val Val Ala Leu. Asp Pro 
130 135 140 

Ser Pro Pro Pro Lys Asp Thr Leu Leu Pro Trp Pro Gin His Gin Asn 
14S 150 155 160 

Ala Asn Gly Leu Arg Arg Arg Lys Arg Asp Trp Val He Pro Pro He 
165 170 175 

Asn Val Pro Glu Asn Ser Arg Gly Pro Phe Pro Gin Gin Leu Val Arg 
180 185 190 

He Arg Ser Asp Lys Asp Asn Asp He Pro He Arg Tyr Ser He Thr 

19S i 200 205 

Gly Val Gly Ala Asp Gin Pro Pro Met Glu Val Phe Ser He Asn Ser 
210 215 220 

Met Ser Gly Arg Met Tyr Val Thr Arg Pro Met Asp Arg Glu Glu His 
225 230 235 240 
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Ala Ser Tyr Hie Leu Arg Ala Hie Ala Val Asp Met Aen Gly Asn Lye 
245 250 255 

Val Glu Aen Pro lie Asp Leu Tyr lie Tyr Val He Aep Met Aen Asp 

5 260 265 270 

Asn Hie Pro Glu Phe He Asn Gin Val Tyr Asn Cys Ser Val Aep Glu 

275 260 285 

Gly Ser Lys Pro Gly Thr Tyr Val Met Thr He Thr Ala Asn Asp Ala 
w 290 295 300 

Asp Asp ser Thr Thr Ala Asn Gly Met Val Arg Tyr Arg He Val Thr 
305 310 315 320 



Gin Thr Pro Gin Ser Pro Ser Gin Asn Met phe Thr He Asn Ser Glu 
325 330 335 

Thr Gly Asp He Val Thr Val Ala Ala Gly Trp Asp Arg Glu Lys Val 
340 345 350 

Gin Gin Tyr Thr Val He Val Gin Ala Thr Asp Met Glu Gly Asn Leu 
355 360 365 

Asn Tyr Gly Leu Ser Asn Thr Ala Thr Ala He He Thr Val Thr Asp 
370 375 380 

Val Asn Asp Aen Pro Ser Glu Phe Thr Ala Ser Thr Phe Ala Gly Glu 
385 390 395 400 

Val Pro Glu Asn Ser Val Glu Thr Val Val Ala Asn Leu Thr Val Met 
405 410 415 

Asp Arg Asp Gin Pro Bis Ser Pro Asn Trp Asn Ala Val Tyr Arg He 
420 425 430 

He Ser Gly Asp Pro Ser Gly Hie Phe Ser Val Arg Thr Asp Pro Val 
435 440 445 

Thr Asn Glu Gly Met Val Thr Val Val Lys Ala Val Asp Tyr Glu Leu 
450 455 460 

35 Asn Arg Ala Phe Met Leu Thr Val Met Val Ser Asn Gin Ala Pro Leu 

465 470 475 480 

Ala Ser Gly He Gin Met Ser Phe Gin Ser Thr Ala Gly Val Thr He 
485 490 495 



15 



20 



25 



30 



40 



Ser He Met Asp He Asn Glu Ala Pro Tyr Phe Pro Ser Asn His Lys 
500 505 510 

Leu He Arg Leu Glu Glu Gly Val Pro Pro Gly Thr Val Leu Thr Thr 

515 520 525 

Phe ser Ala Val Asp Pro Asp Arg Phe Met Gin Gin Ala Val Arg Tyr 

45 530 535 540 

Ser Lys Leu ser Asp Pro Ala Ser Trp Leu His He Asn Ala Thr Asn 
545 550 55S 560 

Gly Gin- lie Thr Thr Val Ala Val Leu Asp Arg Glu Ser Leu Tyr Thr 

so ' 565 570 575 

Lys Asn Asn Val Tyr Glu Ala Thr Phe Leu Ala Ala Asp Asn Gly He 
580 585 590 
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Pro Pro Ala Ser Gly Thr Cly Thr Leu Gin He Tyr Leu He Asp He 
595 600 605 

Asn Aep Asn Ala Pro Clu Leu Leu Pro Lye Glu Ala Gin lie eye Clu 
610 615 620 

Arg Pro Aen Leu Aen Ala He Aim He Thr Ala Ala Asp Ala Asp Val 

635 aao 



625 



630 



640 



Hie Pro Aen He Gly Pro Tyr Val Phe Glu Leu Pro Phe Val Pro Ala 
645 650 655 

Ala Val Arg Lye Aen Trp Thr lie Thr Arg Leu Aen Gly Aep Tyr Ala 
660 665 670 

Gin Leu Ser Leu Arg He Leu Tyr Leu Glu Ala Gly Met Tyr Aep Val 
675 680 685 

Pro tie He Val Thr Aep Ser Gly Aen Pro Pro Leu Ser Aen Thr Ser 
690 695 700 

He He Lye Val Lye Val Cye Pro Cye Asp Asp Aen Gly Aep Cys Thr 
705 * 710 715 720 

Thr He Gly Ala Val Ala Ala Ala Gly Leu Gly Thr Cly Ala He Val 

725 730 735 

Ala He Leu He Cye He Leu He Leu Leu Thr Met Val Leu Leu Phe 
740 745 750 

Val Met Trp Met Lye Arg Arg Glu Lye Glu Arg Hie Thr Lye Gin Leu 
755 760 765 

Leu He Aep Pro Glu Aep Aep Val Arg Glu Lye He Leu Lye Tyr Aep 

770 775 780 

Glu Glu Gly Gly Gly Glu Glu Asp Gin Asp Tyr Asp Leu Ser Gin Leu 
785 790 795 800 

Gin Gin Pro Clu Ala Met Gly His Val Pro Ser Lys Ala Fro Gly Val 
80S 810 815 

Arg Arg Val Aep Clu Arg Pro Val Gly Pro Glu Pro Gin Tyr Pro He 
* * 820 825 830 

Arg Pro Met Val Pro Hie Pro Gly Asp He Gly Asp Phe He Asn Glu 
835 840 845 

Cly Leu Arg Ala Ala Asp Asn Asp Pro Thr Ala Pro Pro Tyr Asp Ser 
850 855 860 

Leu Leu Val Phe Aep Tyr Glu Gly Ser Gly Ser Thr Ala Gly Ser Val 
865 870 875 880 

Ser Ser Leu Asn Ser Ser Ser Ser Gly Aep Gin Asp Tyr Asp Tyr Leu 
885 890 895 

Asn Asp Trp Gly Pro Arg Phe Lye Lys Leu Ala Asp Met Tyr Gly Gly 
900 905 910 

. Gly Clu Glu Asp 
915 
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(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3164 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS: single 

(D) topology : linear 

(ii) MOLECULE TYPE: cDNA 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CTCCACTCAC CCTCAGCCCT CGACGCACAG CCAGTCCAAC GGAACAGAAA CATCCCTCAG 60 

CCCACAGGCA CGATCTGTTC CTCCTCCGAA GATGCACAGG CTATGATGCT CCTCGCCACA 120 

TCGGGCGCCT CCCTGGGCCT GCTGGCAGTG CCAGCAGTGG CAGCAGCACC TGCTAACCCT 180 

GCCCAACGGG ACACCCACAG CCTGCTGCCC ACCCACCGGC GCCAAAAGAG AGATTGGATT 240 

TGGAACCAGA TCCACATTGA TGAAGAGAAA AACACCTCAC TTCCCCATCA TCTAGGCAAG 300 

20 

ATCAACTCAA CCCTGAGTCG CAAGAATGCC AAGTACCTGC TCAAACGACA ATATCTCCGC 360 

AAGCTCTTCC GGGTCGATCC AGAGACAGGA GAC C T G TTCG CCATTGAGAG GCTGGACCGG . 420 

GAGAATATCT CAGAGTACCA CCTCACTGCT GTCATTCTGG ACAAGGACAC TGCCGAAAAC 480 

25 CTCCAGACTC CTTCCAGCTT CACCATCAAA CTTCATGACG TGAACGACAA CTGGCCTGTG S40 

TTCAOGCATC G G TT G TTCAA TGCGTCCCTC CCTGACTCGT CGGCTCTGCG GACCTCACTC 600 

ATCTCTCTGA CACCAGTGGA TGCAGACGAC CCCACTGTGG GAGACCAOCC CTCTCTCATG 660 

^ TACCAAATCC TGAAGGGGAA AGAGTATTTT CCCATOCATA ATTCTGGACG TATTATCACJV 720 

ATAACGAAAA GCTTGGACCG AGAGAAGCAC GCCAGGTATG AGATCGTGGT GGAAGCGCGA 780 

GATGCCCAGG CCCTCCGCGC GGACTCCCCC AOCGCCACCC TGCTCGTCAC TCTGCAAGAC 840 

ATCAATGACA ACTTCCCCTT CTTCACCCAG ACCAACTACA CATTTCTCCT CCCTCAAGAC 900 

35 

ACCCCTCTGC GCACCTCTCT GGGCTCTCTG TTTCTTGAGC ACCCACATCA GCCCCAGAAC 960 

OGGATGACCA AGTACAGCAT CTTGOGGGGC GACTACCAGG ACCCTTTCAC CATTGAGACA 1020 

AACCCCGCCC ACAACGAGGG CATCATCAAG CCCATCAAGC CTCTGGATTA TGAATACATC 1080 

40 CAGCAATACA GCTTCATAGT CCAGCCCACA CACCCCACCA TCGACCTCCG ATACATCAGC 1140 

CCTCCCGCCG GAAACACAGC CCAGGTCATT ATCAACATCA CAGATCTGGA CGACCCCCCC 1200 

ATTTTCCAGC AGCCTTTCTA CCACTTCCAG CTGAAGGAAA ACCAGAAGAA GCCTCTGATT 1260 

45 GGCACACTGC TGGCCATGGA CCCTGATGCG GCTAGGCATA GCATTGGATA CTCCATCCGC 1320 

AGGACCAGTG ACAAGGGCCA GTTCTTCCGA GTCACAAAAA AGGGCCACAT TTACAATCAG 1380 

AAACAACTCG ACAGAG AAGT CTACCCCTGG TATAACCTGA CTCTGCAGGC CAAAGAACTG 1440 

CATTCCACTG GAACCCCCAC AGGAAAACAA TCCATTCTGC AACTCCACAT TGAAGTTTTG 1500 

50 

GATCAGAATG ACAATGCCCC GGA GTTTG CC AAGCCCTACC AGCCCAAACT GTGTGAGAAC 1560 

GCTGTCCATG GCCAGCTGCT CCTGCAGATC TCCGCAATAC ACAAGGACAT AACACCACGA 1620 
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AACGTCAAGT TCAAATTCAT CTTGAATACT GAGAACAACT TTACCCTCAC GGATAATCAC 1680 

GATAACACGG CCAACATCAC AG TCAAGTAT GCGCAGTTTC ACCGGGAGCA TACCAAGGTC 1740 

CACTTCCTAC CCGTGGTCAT CTCAGACAAT GGGATGCCAA GTCCCACGGG CACCAGCACG 1800 

CTGACCGTGG CCCTGTGCAA GTOCAACGAG CAGGGOGAGT TCACCTTCTG CGAGGATATG 1860 

GCCGCCCAGG TGGGCGTGAG CATCCAGGCA GTGGTAGCCA TCTTACTCTG CATCCTCACC 1920 

ATCACAGTGA TCACCCTGCT CATCTTCCTG GGGCGGCCGC TCCGGAAGCA GGOCCGCGCG 1980 

CAOGGCAAGA GCGTGCCGCA GATCCACGAG CAGCTGGTCA CCTACGAOGA CGAGGGCGGC 2040 

GGCGAGATCC ACACCACCAG CTACGATCTG TCGGTGCTCA ACTCCCTGOG COGCGGCGGC 2100 

GCCAAGCCCC CGCGGCCOGC GCTCCACGCC OGCOCTTCCC TCTATGOGCA GCTGCAGAAG 2160 

CCACOGAGGC ACGCGCCTGG GGCACACGGA GGCCCCGCGG AGATGGCAGC CATCATCGAG 2220 

GTGAAGAAGG AOGAGGCGGA CCACGACGCC GACGGCCCCC CCTACGACAC GCTCCACATC 2280 

TACGGCTACG AGGGCTCCGA GTCCATAGCC GAGTCCCTCA GCTCCCTCCG CACCGACTCA 2340 

TCCGACTCTG ACGTGGATTA CGACTTCCTT AACGACTGGG GACCCAGGTT TAAGATGCTG 2400 

GCTGAGCTGT AOGGCTCGGA CCCCCGGGAG GAGCTGCTGT ATTAGCCGGC CGAGGTCACT 2460 

CTGGGCCTGG GGACCCAAAC CCCCTGCAGC CCAGGCCAGT CAGACTCCAG GCAOCACAGC 2520 

CTCCAAAAAT GGCAGTCACT CCCCAGCCCA GCACCCCTTC CTCGTGGCTC CCAGAGACCT 2580 

CATCAGCCTT GGGATAGCAA ACTCCAGGTT CCTCAAATAT CCACGAATAT ATGTCAGTGA 2640 

TGACTATTCT CAAATGCTGC CAAATCCAGG CTGCTCTTCT GTCTGGGCTC AGACATCCAC 2700 

ATAACCCTGT CACCCACAGA CCCCCGTCTA ACTCAAAGAC TTCCTCTGGC TCCCCAAGGC 2760 

TCCAAAGCAA AACAGACTGT GTTTAACTGC TGCAGGCTCT TTTTCTACGG TCCCTGAACG 2820 

CCCTGCTAAC GCTCGTGACG TCCTCGTGCC TATCXGCCTC GAGGCAAAGG CCTGGACAGC 2880 

TTGACTTCTG GGGCAGGATT CTCTGCAGCC CATTCCCAAG GGAGACTGAC CATCATGCCC 2940 

TCTCTCGCGA CCCCTAGCCC TGCTCCAACT CCATACTCCA CTCCAAGTGC CCCACCACTC 3000 

CCCAACCCCT CTCCACGCCT CTCAAGACCG ACCAAGGGGC CCCATGGCAG CTCCTGACCT 3060 

TGGCTCCTGA ACTGACCTCA CTCGCCTGCC ATGCCAGTAA CTGTGCTCTA CTCAGCACTG 3120 

AACCACATTC AGGCAAATCG CTTATTAAAC TTTGAAGCAA CTGT 3164 
(2) INFORMATION FOR SBQ ID NOxSO: 

(1) SEQUENCE CHARACTERISTICS x 

(A) LENGTH t 780 amino acids 

(B) TTPEi amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE i protein 

(xi) SEQUENCE DESCRIPTION x SEQ ID NO: 50: 

Met Met Leu Leu Ala Thr Ser Gly Ala Cys Leu Gly Leu Leu Ala Val 
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Ala Ala Val Ala Ala Ala Cly Ala Asn Pro Ala Gin Arg Asp Thr His 
20 25 30 

Ser Leu Leu Pro Thr His Arg arg Gin Lys Arg Asp Trp lie Trp Asn 
35 40 45 

Gin Met His He Asp Glu Glu Lys Asn Thr ser Leu Pro His 'His Val 
50 55 60 ' 

Cly Lye He Lys Ser Ser Val Ser Arg Lys Asn Ala Lys Tyr Leu Leu 

10 65 70 75 80 

Lys Gly Glu Tyr Val Gly Lys Val Phe Arg Val Asp Ala Glu Thr Gly 
85 90 95 



15 



20 



25 



30 



Asp Val Phe Ala He Glu Arg Leu Asp Arg Glu Asn He Ser Glu Tyr 
100 105 HO 

His Leu Thr Ala Val He Val Asp Lys Asp Thr Gly Glu Asn Leu Glu 
115 120 125 

Thr Pro Ser Ser Phe Thr He Lys Val His Asp Val Asn Asp Asn Trp 
130 135 140 

Pro Val Phe Thr His Arg Leu Phe Asn Ala Ser Val Pro Glu Ser Ser 
145 150 155 160 

Ala Val Gly Thr Ser Val He Ser Val Thr Ala Val Asp Ala Asp Asp 
165 170 175 

Pro Thr Val Cly Asp His Ala Ser Val Met Tyr Gin He Leu Lys Gly 
180 185 190 

Lye Glu Tyr Phe Ala He Asp Asn Ser Gly Arg He He Thr He Thr 
195 200 205 

Lys Ser Leu Asp Arg Glu Lys Gin Ala Arg Tyr Glu He Val Val Glu 
210 215 220 

Ala Arg Asp Ala Gin Cly Leu Arg Gly Asp Ser Gly Thr Ala Thr Val 
225 230 235 240 

35 Leu Val Thr Leu Gin Asp He Asn Asp Asn Phe Pro Phe Phe Thr Gin 

245 250 255 

Thr Lys Tyr Thr Phe Val Val Pro Glu Asp Thr Arg Val Gly Thr Ser 
260 265 270 

40 Val Cly Ser Leu Phe Val Glu Asp Pro Asp Glu Pro Gin Asn Arg Met 

275 280 285 

Thr Lys Tyr ser He Leu Arg Cly Asp Tyr Gin Asp Ala Phe Thr He 
290 295 300 

Glu Thr Asn Pro Ala Hie Asn Glu Gly He He Lys Pro Met Lys Pro 

45 305 310 315 320 

Leu Asp Tyr Glu Tyr He Gin Gin Tyr Ser Phe He Val Glu Ala Thr 
325 330 335 

Asp Pro -Thr He Asp Leu Arg Tyr Met Ser Pro Pro Ala Gly Asn Arg 
50 v 340 345 350 

Ala Gin Val He He Asn He Thr Asp Val Asp Glu Pro Pro He Phe 
355 360 365 
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Gin Gin Pro Phe Tyr His Phe Gin Leu Lya Glu Asn Gin Lye Lye Pro 
370 375 380 

Leu He Gly Thr Val Leu Ala Met Aep Pro Asp Ala Ala Arg Hie Ser 
385 390 395 400 

He Gly Tyr Ser He Arg Arg Thr Ser Asp Lys Gly Gin Phe Phe Arg 
405 410 415 

Val Thr Lye Lye Gly Asp He Tyr Asn Glu Lye Glu Leu Asp Arg Glu 
420 425 430 

Val Tyr Pro Trp Tyr Asn Leu Thr Val Glu Ala Lys Glu Leu Asp Ser 
435 440 445 

Thr Gly Thr Pro Thr Gly Lys Glu Ser He Val Gin Val His He Glu 
450 455 460 

Val Leu Asp Glu Asn Asp Asn Ala Pro Glu Phe Ala Lys Pro Tyr Gin 
465 ~ 470 475 480 

Pro Lys Val Cys Glu Asn Ala Val His Gly Gin Leu Val Leu Gin He 
485 490 495 

Ser Ala He Asp Lys Asp He Thr Pro Arg Asn Val Lys Phe Lys Phe 
500 505 510 

He Leu Asn Thr Glu Asn Asn Phe Thr Leu Thr Asp Asn His Asp Asn 

515 520 525 

Tiir Ala Asn He Thr Val Lys Tyr Gly Gin Phe Asp Arg Glu His Thr 
530 535 540 

Lys Val His Phe Leu Pro Val Val He ser Asp Asn cly Met Pro Ser 
545 550 555 560 

Arg Thr Cly Thr Ser Thr Leu Thr Val Ala Val Cys Lys Cys Asn Glu 
565 570 575 

Gin Gly Glu Phe Thr Phe Cys Glu Asp Met Ala Ala Gin Val Gly Val 
580 585 590 

Ser He Gin Ala Val Val Ala He Leu Leu Cys He Leu Thr He Thr 
595 600 605 

Val He Thr Leu Leu He Phe Leu Arg Arg Arg Leu Arg Leu Gin Ala 
610 615 620 

Arg Ala His Gly Lys Ser Val Pro Glu He His Glu Gin fceu Val Thr 
625 630 635 640 

Tyr Asp Glu Glu Gly Cly Gly Glu Met Asp Thr Thr Ser Tyr Asp Val 
645 650 655 

Ser Val Leu Asn Ser Val Arg Arg Gly Gly Ala Lys Pro Pro Arg Pro 
660 665 670 

Ala Leu Asp Ala Arg Pro Ser Leu Tyr Ala Gin Val Gin Lys Pro Pro 
675 680 685 

Arg His Ala Pro Gly Ala His Cly Gly Pro Cly Clu Met Ala Ala Met 
690 695 700 

He Glu Val Lys Lys Asp Clu Ala Asp His Asp Gly Aap Gly Pro Pro 
705 710 715 720 
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Tyr Asp Thr Leu His Zle Tyr Gly Tyr Glu Gly Ser Glu ser He Ala 
725 730 735 

Glu Ser Leu Ser Ser Leu Gly Thr Asp Ser Ser Asp Ser Asp Val Asp 
740 745 750 

Tyr Asp Phe Leu Asn Asp Trp Gly Pro Arg Phe Lys Met Leu Ala Glu 
755 760 765 

Leu Tyr Gly Ser Asp Pro Arg Glu Glu Leu Leu Tyr 
770 775 780 

(2) INFORMATION FOR SEQ ID NO: 51: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1369 base pairs 

(B) TYPE: nucleic acid 

75 (C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 

TCTAGATGAG OCACCTCTCT TCAGCAAACT GCCCTACATC TTACAAATAA GAGAACATGC 60 

TCAGATAAAC ACCACAATAG GCTCCGTCAC AGCCGAAGAT CCAGATGCTG CCAGGAATCC 120 

25 TCTCAAGTAC TCTATAGATC GACACACAGA TATGCACAGA ATATTCAACA TTGATTCTCG 180 

AAATGCTTCG ATTTTTACAT CGAAACTTCT TGACCGAGAA ACACTGCTAT GGCACAACAT 240 

TACACTGATA GCAACAGAGA TCAATAATCC AAAGCAAAGT AGTCGAGTAC CTCT AT AT AT 300 

TAAA G TTCTA GATGTCAATG ACAACCCCCC AGAATTTGCT CAGTTCTATG AAACTTTTCT 360 

30 

CTGTGAAAAA GCAAAGGCAG ATCAGTTCAT TCAGACCTTG CATGCTGTTA GCAAGGATGA 420 

CCCTTATAGT GGGCACCAAT TTTCGTTTTC CTTGCCCCCT GAAGCAGCCA GTGGCTCAAA 480 

CTTTACCATT CAAGACAACA AAGACAACAC GGCGGGAATC TTAACTCGGA AAAATGCCTA 540 

35 TAATAGACAC GAGATGAGCA CCTATCTCTT CCCTGTCGTC ATTTCAGACA AOGACTACCC 600 

AGTTCAAAGC AGCACTGGGA CAGTGACTGT CCGGGTCTGT GCATGTGACC ACCACGGGAA 660 
CATGCAATCC TGCCATCCGG ACGCGCTCAT CCACCCCACG GGACTCAGCA CGCCGGCTCT 720 

40 GGTTGCCATC CTTCTCTGCA TCGTGATCCT ACTAGTGACA GTGGTGCTGT TTGCACCTCT 780 

GAGGCGGCAG CGAAAAAAAG AGCCTTTGAT CATTTCCAAA GAGGACATGA GAGATAACAT 840 
TCTCAGTTAC AAOCACGAAG GTCGTGGAGA GGAGGACACC CACGCTTTTC ATATOGGCAC 900 
CCTGAGGAAT CCTGAAGCCA TAGAGGACAA CAAATTACGA AGGGACATTG TGCCCGAAGC 960 

45 

CCTTTTCCTA . CCCCGAOGGA CTCCAACACC TCGCGACAAC ACCCATGTCA GAGATTTCAT 1020 

TAACCAAAGG TTAAAGGAAA ATCACAOGGA CCOCACTGCC CCGCCATAOG ACTOCCT G GC 1080 

CACTTACGCC TATGAAGGGA CTGCCTCCGT GGCGGATTCC CTGACCTCGC TGGAGTCAGT 1140 

GACCACCGAT GCAGATCAAG ACTATGATTA CC T T T A CT G A CTGGGACCTC GATTCAAAAA 1200 

GCTTCCAGAT ATCTATGGAG CAGTGGACAG TGACAAAGAC TCCTAATCTG T T CC CTT T T I 1260 
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CATTTTCCAA TACGACACTG AAATATGTGA AGTGCCTATT TCTTTATATT TATCCACTAC 
TCOCTCAAGG CTTCTCTCTT CTACCCCTTC CAAAAGCCAA TGGCTGCAC 
(2) INFORMATION FOR SEQ ID NO: 52: 

.* 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH x 414 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE* protein 



(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 52: 

Val Asp Glu Pro Pro Val Phe Ser Lys Leu Ala Tyr He Leu Cln He 
15 10 15 

Arg Clu Asp Ala Gin He Asn Thr Thr He Gly Ser Val Thr Ala Gin 
20 25 30 

Asp Pro Aep Ala Ala Arg Asn Pro Val Lys Tyr Ser He Lys Arg His 
35 40 45 

Thr Asp Met Asp Arg He Phe Asn He Asp ser Gly Asn Gly ser He 
50 55 60 

Phe Thr Ser Lys Leu Leu Lys Arg Clu Thr Leu Leu Trp His Asn He 
65 70 75 80 

Thr Val He Ala Thr Clu He Asn Asn Pro Lys Cln ser Ser Arg Val 
85 90 95 

Pro Leu Tyr He Lya Val Leu Asp Val Asn Asp Asn Ala Pro Clu Phe 
100 105 HO 

Ala Clu Phe Tyr Clu Thr Phe Val Cys Glu Lys Ala Lys Ala Asp Cln 
115 120 125 

Leu He Cln Thr Leu His Ala Val Asp Lys Asp Asp Pro Tyr Ser Gly 
130 - 135 140 

Bis Cln Phe Ser Phe Ser Leu Ala Pro Clu Ala Ala Ser Gly Ser Asn 
145 150 155 160 

Phe Thr He Gin Asp Asn Lys Asp Asn Thr Ala Cly He Leu Thr Arg 
165 170 175 

Lys Asn Cly Tyr Asn Arg His Glu Met Ser Thr Tyr Leu Leu Pro Val 
J 180 185 . 190 

Val He Ser Asp Asn Asp Tyr Pro Val Gin Ser Ser Thr Cly Thr Val 
195 200 205 

Thr Val Arg Val Cys Ala Cys Asp His His Cly Asn Met Cln Ser Cys 
210 215 220 

His Ala Cltt Ala Leu He His Pro ?hr Cly Leu Ser Thr Cly .Ala Leu 
225 230- 235 240 

Val Ala He Leu Leu Cys He Val He Leu Leu Val Thr Val Val Leu 
245 250 255 

Phe Ala Ala Leu Arg Arg Cln Arg Lys Lys Clu Pro Leu He He Ser 
260 265 270 
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Lys Clu Asp He Arg Asp Asn He Val Ser Tyr Asn Asp Clu Cly Gly 
275 280 285 

Gly Glu Glu Asp Thr Gin Ala Phe Asp lie Gly Thr Leu Arg Asn Pro 
290 295 300 

Glu Ala He Glu Asp Asn Lys Leu Arg Arg Asp He Val Pro Clu Ala 
305 310 315 320 

Leu Phe Leu Pro Arg Arg Thr Pro Thr Ala Arg Asp Asn Thr Asp Val 
325 330 335 

Arg Asp Phe He Asn Gin Arg Leu Lys Glu Asn Asp Thr Asp Pro Thr 
340 345 350 

Ala Pro Pro Tyr Asp Ser Leu Ala Thr Tyr Ala Tyr Glu Gly Thr Gly 
355 360 365 

Ser Val Ala Asp Ser Leu Ser Ser Leu Glu Ser Val Thr Thr Asp Ala 
370 375 380 

Asp Gin Asp Tyr Asp Tyr Leu Ser Asp Trp Gly Pro Arg Phe Lys Lye 
385 390 395 400 

Leu Ala Asp Met Tyr Gly Gly Val Asp Ser Asp Lys Asp Ser 
405 410 

(2) INFORMATION FOR SEQ ID NO:53x 

(1) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTHS 2550 base pairs 

(B) TYPEt nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY t linear 



10 



15 



20 



30 



40 



45 



50 



(ii) MOLECULE TYPE* cDNA 



(Xi) SEQUENCE DESCRIPTION x SEQ ID NO: 53: 

CAGGAAATGC TCTTGGATCT CTGCACTCCA TTAATAATAT TATCCATTAC TCTTCCCCCT 60 

35 TGCATTTACA TCGCTCOCAT GAATCAGTCT CAA CTTTT AA TGAGTGGATC CCCTTTGGAA 120 

CTAAACAGTC TGGGTGAAGA ACAGCGAATT TTGAACCGCT CCAAAAGAGG CTGGGTTTGG 180 

AATCAAATGT TTGTCCTGGA AGAGTTTTCT GGACCTGAAC CGATTCTTGT TGGCCGGCTA 240 

CACACAGACC TCGATCCTGC GAGCAAAAAA ATCAAGTATA TCCTATCAGC TGATGGAGCT 300 

GGGACCATAT TTCAAATAAA TGATCTAACT GGAGATATCC ATCCTATAAA AAGACTTGAC 360 

OGGGAGGAAA AGG CTG ACTA TACCCTAACA GCTCAAGCAG TGGACTGGGA GACAAGCAAA 420 

CCTCTCGAGC CTCCTTCTGA ATTTATTATT AAACTTCAAC ACATCAATGA CAATGCACCA 480 

GAGTTTCTTA ATGGACCCTA TCATCCTACT GTGCCAGAAA TGTCCATTTT GGGTACATCT 540 

CTCACTAACC TCACTGCGAC CGACGCTCAT GACCCAGTTT ATGGAAACAG TCCAAAGTTC 600 

CTTTATACTA TATTGGAAGG GCAG CCTTAT TTTTCCATTC AGCCTGAAAC AGCTATTATA 660 

AAAACTGCCC TTCCCAACAT GGAGAGAGAA GCCaAGGAGG ACTACCTCGT TCTTATCCAA 720 

GCCAAAGATA TGGGTGGACA CTCTGGTGGC CTCTCTGGGA CCACGACACT TACAGTGACT 780 
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CTTACTGATG TTAATGACAA TCCTCCAAAA TTTGCACACA CCCTCTATCA CTTCTCACTA 

CCGGAACATC TCCTTCTTCC CACTCCAATA GGAXGGGTGA ACCCCAATCA TCACGATATT 

CCTCAAAATC CACACTCATC ATATCATATC ATCGATGGAG ATGGAACAGC ACTTTTTGAA 

ATCACTTCTC ATCCCCACGC CCACCATGGC ATTATAAGCC TAACAAAACC TCTCGACTTT 

GAGACCAAAA AATCCTATAC GCTAAACGAT GACGCAGCCA ATGTCCATAT TGACCCACGC 

TTCAGTGCCA GCGGGCCCTT TAAAGACACC GCGACAGTCA AAATOGTCCT TCAAGATGCT 

GATCAGCCTC CGGTCTTCTC TTCACCGACT TACCTACTTG AAGTTCATGA AAATGCTGCT 

CTAAACTCCG TCATTGGGCA ACTCACTGCT CGTCACCCTG ATATCACTTC CAGTCCTATA 

AGGTTTTCCA TCGACCGGCA CACTGACCTG GAGACGCAGT TCAACATTAA TGCAGACGAT 

GGGAAGATAA CGCTGGCAAC ACCACTTGAC AGAGAATTAA GTGTATGGCA CAACATAACA 

ATCATTCCTA CTGAAATTAG CAACCACAGT CAGATATCAC GAGTACCTGT TCCTATTAAA 

CTGCTGGATG TCAATGACAA CGCCCCTGAA TTCGCATCCG AATATCAGCC ATTTTTATGT 

GAAAATGGAA AACCCGGCCA ACTCATTCAA ACTCTTAGCG CCATGGACAA AGATGATCCC 

AAAAACGGAC ATTATTTCTT ATACAGTCTC CTTCCAGAAA TGCTCAACAA TCCGAATTTC 

ACCATCAAGA AAAATGAAGA TAATTCCCTC AGTATTTTGG CAAAGCATAA TGGATTCAAC 

CGCCAGAAGC AAGAAGTCTA TCTTTTACCA ATCATAATCA GTGATAGTCG AAATCCTCCA 

CTQAGCAGCA CTAGCACCTT GACAATCAGC CTCTCTGGCT CCAGCAATGA OGCTGTCGTC 

CAGTCTTGCA ATCTCGAAGC TTATGTCCTT CCAATTGGAC TCAGTATGGG "CGCCTTAATT 

GCCATATTAG CATGCATCAT TTTGCTGTTA GTCATOGTGG TCCTGTTTCT AACTCTACGC 

CGGCATCAAA AAAATGAACC ATTAATTATC AAAGATGATG AAGACCTTCG AGAAAACATC 

ATTCGCTACG ATGATCAAGC AGCAGGGGAG GAGGACACAG AGCCTTTTGA CATTGCAACT 

TTACAAAATC CAGATGGAAT TAATGCATTT TTAOCCOGTA AGGATATTAA ACCAGATTTG 

CAGTTTATGC CAAGGCAAGG GCTTGCTCCA GTTCCAAATG GTGTTGATGT CGATGAATTT 

ATAAATGTAA GGCTGCATGA GGCAGATAAT GATCCCACAG CCCCGCCATA TGACTCCATT 

CAAATATATG GCTATGAAGG CCGACCGTCA GTGCCTGCCT CCCTCACCTC CTTCGAGTCC 

ACCACATCAG ACTCAGACCA CAATTTT G AC TACCTCAGTG ACTGGGGTCC CCGCTTTAAG 

AGACTGGGCG AACTCTACTC TCTTCCTOAA AGTCAGAAAG AAACTTGACA GTGGATTATA 

AATAAATCAC TGCAACTGAG CATTCTGTAA TATTCTAGGC TCACTCCCCT TAGATACAAC 

CAATCTCGCT ATTTGTTTAG AGGCAAGTTT AGCACCAGTC ATCTATAACT CAACCACATT 

TA ATCT TGAC AAAAAGATAA TAAATAAAAA 

(2) INFORMATION Jt)R SEQ ID HOi54* 

(1) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 793 amino acids 
(B> TYPEs amino acid 
(D) TOPOLOGY: linear 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 
2S50 
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(ii) MOLECULE TYPEr protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Leu Leu Asp Leu Trp Thr Pro Leu lie lie Leu Trp lie Thr Leu 
15 10 is 

Pro Pro Cys He Tyr Met Ala Pro Met Asn Gin Ser Gin Val Leu Met 
20 25 30 

Ser Gly Ser Pro Leu Gin Leu Asn Ser Leu Gly Glu Glu Gin Arg He 
35 40 45 

Leu Asn Arg Ser Lys Arg Gly Trp Val Trp Aan Gin Met Phe Val Leu 
50 55 60 

Glu Glu Phe Ser Gly Pro Glu Pro He Leu Val Gly Arg Leu His Thr 
65 70 75 80 

Asp Leu Asp Pro Gly Ser Lys Lys He Lys Tyr He Leu Ser Gly Asp 
85 90 95 

Gly Ala Gly Thr He Phe Gin He Asn Asp Val Thr Gly Asp He His 
100 105 HO 

Ala He Lys Arg Leu Asp Arg Glu Glu Lys Ala Glu Tyr Thr Leu Thr 
115 120 125 

Ala Gin Ala Val Asp Trp Glu Thr Ser Lys pro Leu Glu Pro Pro Ser 
130 135 140 

Glu Phe He He Lys Val Gin Asp He Asn Asp Asn Ala pro Glu Phe 
145 150 1S5 160 

Leu Asn Gly Pro Tyr His Ala Thr Val Pro Glu Met Ser He Leu Gly 
165 170 175* 

Thr Ser Val Thr Asn Val Thr Ala Thr Asp Ala Asp Asp Pro Val Tyr 
180 185 190 

Gly Asn Ser Ala Lys Leu Val Tyr Ser He Leu Glu Gly Gin Pro Tyr 
195 200 205 

Phe Ser He Glu Pro Glu Thr Ala He He Lys Thr Ala Leu Pro Asn 
210 215 220 

Met Asp Arg Glu Ala Lye Glu Glu Tyr Leu Val Val He Gin Ala Lys 
225 230 235 240 

Asp Met Gly Gly His Ser Gly Gly Leu Ser Gly Thr Thr Thr Leu Thr 
245 250 255 

Val Thr Leu Thr Asp Val Asn Asp Asn Pro Pro Lys Phe Ala Gin Ser 
260 265 270 

Leu Tyr His Phe Ser Val Pro Glu Asp Val Val Leu Gly Thr Ala He 
275 280 285 

Gly Arg Val Lys Ala Asn Asp Gin Asp He Gly Glu Asn Ala Gin Ser 

290 * " 295 300 

Ser Tyr Asp He He Asp Gly Asp Gly Thr Ala Leu Phe Glu He Thr 
305 310 315 320 
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Ser Asp Ma Gin Ala Cln Asp Gly lie lie Arg Leu Arg Lys Pro Leu 
325 330 33b 

Asp Phe Glu Thr Lys Lye Ser Tyr Thr Leu Lye Asp Glu Ala Ala Asa 
340 345 350 

Val His He Asp Pro Arg Phe Ser Gly Arg Gly Pro Phe Lys Aep Thr 
355 360 365 

Ala Thr Val Lye He Val Val Glu Asp Ala Asp Glu Pro Pro Val Phe 
370 375 380 

ser Ser Pro Thr Tyr Leu Leu Glu Val Hie Glu Aen Ala Ala Leu Aen 
385 390 395 400 

ser Val He Gly Cln Val Thr Ala Arg Aep Pro Aep He Thr Ser Ser 
405 410 415 

Pro He Arg Phe Ser He Asp Arg Hie Thr Aep Leu Glu Arg Gin Phe 
420 425 430 

Aen He Aen Ala Aep Aep Gly Lye He Thr Leu Ala Thr Pro Leu Aep 
435 440 < 445 

Ara Glu Leu Ser Val Trp Hie Aen He Thr He He Ala Thr Glu He 
450 455 460 

Arg Aen Hie Ser Cln He Ser Arg Val Pro Val Ala He Lye Val Leu 
465 470 475 «80 

Aep Val Aen Aep Aen Ala Pro Glu Phe Ala Ser Glu Tyr Glu Ala Phe 
485 490 495 

Leu Cye Glu Aen Gly Lye Pro Gly Cln Val He Gin Thr Val Ser Ala 
500 505 510 

Met Aep Lye Aep Aep Pro Lye Aen Gly His Tyr Phe Leu Tyr Ser Leu 
515 520 525 

Leu Pro Glu Met Val Aen Aen Pro Aen Phe Thr He Lye Lye Aen Glu 
530 535 540 

Aep Aen Ser Leu Ser He Leu Ala Lye Hie Aen Gly Phe Aen Arg Gin 

545 550 555 => t »° 

Lye Gin Glu Val Tyr Leu Leu Pro He He He Ser Aep Ser Gly Aen 
* 565 570 575 

Pro Pro Leu Ser Ser Thr Ser Thr Leu Thr He Arg Val Cye Gly Cye 

580 585 590 

Ser Aen Aep Gly Val val Cln ser Cye Aen Val clu £la Tyr Val Leu 
595 600 605 

Pro He Gly Leu Ser Ket Gly Ala Leu He Ala He Leu Ala Cye He 
610 615 «0 

He Leu Leu Leu Val He Val Val Leu Phe Val Thr Leu Arg Arg Hie 
625 630 635 640 

Cln Lye Aen Glu Pro Leu He He Lye Aep Aep Clu Aep Val Arg Clu 
645 650 655 

Aen He He Arg Tyr Asp Aep Clu Gly Cly Gly Glu Glu Aep Thr Glu 
660 665 670 
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Ala Phe Asp He Ala Thr Leu Gin Asn Pro Asp Gly He Asn Gly Phe 
675 680 685 

Leu Pro Arg Lys Asp He Lys Pro Asp Leu Gin Phe Met Pro Arg Gin 
690 695 700 

Gly Leu Ala Pro Val Pro Asn Gly Val Asp Val Asp Glu Phe tie Asn 
705 710 715 720 

Val Arg Leu Hie Glu Ala Asp Asn Asp Pro Thr Ala Pro Pro Tyr Asp 
725 730 735 

Ser He Gin He Tyr Gly Tyr Glu Gly Arg Gly Ser Val Ala Gly Ser 
740 745 750 

Leu Ser Ser Leu Glu Ser Thr Thr Ser Asp Ser Asp Gin Asn Phe Asp 
755 760 765 

Tyr Leu Ser Asp Trp Gly Pro Arg Phe Lys Arg Leu Gly Glu Leu Tyr 
770 775 780 

Ser Val Gly Glu Ser Asp Lys Glu Thr 
785 790 

(2) INFORMATION FOR SEQ ID NO: 55: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 730 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDKDNESS : single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDKA 



(tx) FEATURES 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 730 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:5Sx 

G AAT TCG AGC TCG GTA CCC GCC GAT CCT CTA GAG TOG ACC TGC ACT 
Asn Ser Ser Ser Val Pro Gly Asp' Pro Leu Glu Ser Thr Cys Ser 
1 5 10 15 

GCT GAA GCC CTG CTC CTC CCT GCC CCC CTC AGC ACT GGG GCC TTG ATC 
Ala Glu Ala Leu Leu Leu Pro Ala Gly Leu Ser Thr Gly Ala Leu He 
20 25 30 

CCC ATC CTC CTC TGC ATC ATC ATT CTA CTG GTT ATA GTA CTA CTC TTT 
Ala He Leu Leu Cys He He He Leu Leu Val He Val Val Leu Phe 
35 40 45 

CCA CCT CTC AAA AGA GAG CCA AAA AAA GAG CCT CTG ATC TIC TCA AAA 
Ala Ala Leu Lys Arg Cln Arg Lys Lys Glu Pro Leu He Leu Ser Lys 
50 55 60 

GAA GAT ATC AGA GAC AAC ATT CTG ACC TAT AAC GAT GAG CCT GGT GGA 
Glu Asp He Arg Asp Asn He Val Ser Tyr Asn Asp Glu Gly Gly Gly 
65 70 75 

GAG GAG GAC ACC CAG GCC TTT GAT ATC GGC ACC CTG AGG AAT CCT CCA 
Glu Glu Asp Thr Gin Ala Phe Asp He Gly Thr Leu Arg Asn Pro Ala 
80 85 90 95 
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GCC ATT GAG GAA AAA AAG CTC CGC CGA GAT ATT ATT CCA GAA ACG TTA 334 
Ala lie Glu Glu Lyo Lye Leu Arg Arg Aep lie lie Pro Glu Thr Leu 
100 105 110 

TTT ATT CCT CGG AGG ACT CCT ACA GCT CCA GAT AAC ACG GAC GTC CGG 382 
Phe lie Pro Arg Arg Thr Pro Thr Ala Pro Asp Asn Thr Asp Val Ar£ 
115 120 125 

GAT TTC ATT AAT GAA AGG CTA AAA GAG CAT CAT CTT GAC CCC ACC GCA 430 
Asp Phe He Aen Glu Arg Leu Lye Glu Hie Asp Leu Asp Pro Thr Ala 
130 135 140 

CCC CCC TAC GAC TCA CTT CCA ACC TAT GCC TAT GAA CCA AAT GAT TCC 478 
Pro Pro Tyr Aep Ser Leu Ala Thr Tyr Ala Tyr Glu Gly Asn Aep Ser 
145 150 155 

ATT CCT GAA TCT CTC ACT TCA TTA GAA TCA GCT ACT ACT GAA GGA GAC 526 
He Ala Glu Ser Leu Ser Ser Leu Glu Ser Gly Thr Thr Glu Gly Asp 
160 165 170 175 

CAA AAC TAC GAT TAC CTC CCA GAA TGG GCC CCT CGG TTT AAT AAG CTA 574 
Gin Aen Tyr Asp Tyr Leu Arg Glu Trp Gly Pro Arg Phe Asn Lys Leu 

180 ... 18S 190 

GCA GAA ATG TAT GCT GGT GGG GAA AGT GAC AAA GAC TCT TAA CGT AGG 622 
Ala Glu Met Tyr Gly Gly Gly Glu Ser Asp Lys Asp Ser * Arg Arg 
195 200 205 

ATA TAT GTT CTC TTC AAA CAA GAG AAA GTA ACT CTA CCC ATG CTG TCT 670 
He Tyr Val Leu Phe Lys Gin Glu Lye Val Thr Leu Pre Met Leu Ser 
210 215 220 

CCA CTT CAC AAT ATT TCA TAT TCA CGA GCA TTT CCT GCA GTC AGC ACA 718 
Pro Leu Bis Asn He * Tyr Ser Gly Ala Phe Pro Ala Val Ser Thr 
225 230 235 



30 ATT TTT TTC TCA 

He Phe Phe Ser 
240 

(2) INFORMATION FOR SEQ ID WOx56: 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH t 241 amino acids 

(B) TYPE* amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPES protein 

40 (xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 56: 

Asn Ser Ser Ser Val Pro Gly Asp Pro Leu Glu Ser Thr Cye Ser Ala 
1 5 10 15 



Glu Ala Leu Leu Leu Pro Ala Gly Leu Ser Thr Gly Ala Leu He Ala 
20 25 30 

He Leu Leu Cys He He He Leu Leu Val He Val Val Leu Phe Ala 
35 40 45 

Ala Leu Lys Arg Gin Arg Lys Lye Glu Pro Leu He Leu Ser Lys Glu 

50 " 55 60 

Asp He Arg Asp Asn He Val Ser Tyr Asn Asp Glu Gly Gly Gly Glu 
65 70 7S 80 
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Glu Asp Thr Gin Ala Phe Asp lie Gly Thr Leu Arg Asn Pro Ala Ala 

85 90 95 

lie Glu Glu Lys Lys Leu Arg Arg Asp lie lie Pro Glu Thr Leu Phe 
100 105 110 

lie Pro Arg Arg Thr Pro Thr Ala Pro Asp Asn Thr Asp Val Arg Asp 
115 120 125 

Phe He Asn Glu Arg Leu Lys Glu His Asp Leu Asp Pro Thr Ala Pro 
130 135 140 

Pro Tyr Asp Ser Leu Ala Thr Tyr Ala Tyr Glu Gly Asn Asp Ser He 
145 150 155 160 

Ala Glu Ser Leu Ser Ser Leu Glu Ser Gly Thr Thr Glu Gly Asp Gin 
165 170 175 

Asn Tyr Asp Tyr Leu Arg Glu Trp Gly Pro Arg Phe Asn Lys Leu Ala 
160 165 190 

Glu Met Tyr Gly Gly Gly Glu Ser Asp Lys Asp Ser Arg Arg lie Tyr 
195 200 205 

Val Leu Phe Lys Gin Glu Lya Val Thr Leu Pro Met Leu ser Pro Leu 
210 215 220 

His Asn He Tyr Ser Gly Ala Phe Pro Ala Val Ser Thr He Phe Phe 
225 230 235 240 

25 *er 

<2) INFORMATION FOR SEQ ID NOt57t 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2625 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRAND ED NESS t single 
<D> TOPOLOGY: linear 

<ii) MOLECULE TYPE: CDNA 

35 

(xl) SEQUENCE DESCRIPTION : SEQ ID NO: 57: 

CGCCACCCCT GACGTGATGA CCTCAACCAG CAGAGACATT CCATCCCAAG AGAGGTCTGC 60 

40 GTCACGCCTC CGGCAGGCCA CCCTCAGCAA GACCAC0GTA CACTTCCTGC AAGGGGTGAC 120 

AOCTGCATTC TCCTCTGCCT ACCACCTAAC CAAAAATCAA GGAGAACTAC TGTTTACAAG 180 

COCCCCTCCT CT GC CT C GCC ATGCTCTGCC ACAGCCATGC CTTTGCCCCA GAGCGGCGGG 240 

GGCACCTGCG GCCCTCCTTC CATGGGCACC ATGAGAAGCC CAAGCAGGGG CAGGTGCTAC 300 

AGCCCTCCAA GCGTCCCTGG CTCTGGAACC AGTTCTTCCT GATAGAGGAG TACACCGGGC 360 

CTCACCCCGT GCTTGTGGGQ AGGCTTCATT CAGATATTGA CTCT C GTGAT CGGAACATTA 420 

AATACATTCT CTCAGGGGAA GGAGCTGGAA CCATTTTTGT GATTGATGAC AAATCACGGA 480 

ACATTCATGC CACCAAGAOG TTGGATOGAG AAGASAG AGC CCAGTACACG TTGATGGCTC 540 

AGGCGCTGCA CAGGGACACC AATOGGCCAC TCGAGCCACC GTCGGAATTC ATTCTCAAGC 600 
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TCCAGCACAT TAATGACAAC CCTCCCGACT TCCTCCACCA CACCTATCAT CCCAACCTCC 
CTCAGACGTC CAATGTGGGA ACCTCAGTAA TCCAGCTCAC ACCTTCACAT CCAGATCACC 
CCACTTATGC AAATACCCCC AAGTTAGTGT ACAGTATCCT OCAAGGACAA CCCTATTTTT 
CGGTGGAAGC ACAGACAGCT ATCATCAGAA CAGCOCTAOC CAACATGGAC AGGGAGGCCA 
AGCAGGAGTA CCACGTGGTG ATCCACGCCA AGGAGATCGG TGGACATATG GGCGGACTCT 
CAGGCACAAC CAAAGTGACC ATCACACTGA COGATGTCAA TCACAACCCA CCAAAGTTTC 
OGCAGAGCCT ATACCAGATG TCTGTGTCAG AAGCAGCCGT CCCTGGGCAG GAAGTAGGAA 
CAGTGAAAGC TAAAGATCCA CACATTCGAG AAAATGCCTT AGTCACATAC AATATTCTTC 
ATGGAGATGG TATGGAATCG TTTGAAATCA CAACGGACTA TGAAACACAG CAGGGGCTGA 
TAAAGCTGAA AAAGCCTGTA GATTTTCAAA CCGAAAGAGC CTATACCTTG AAGGTAGAGG 
CAGCCAACCT GCACATOGAC COGAAGTTTA TCAGCAATGG CCCTTTCAAC GACACTGTGA 
CCGTCAAGAT CTCAGTAGAA GATGCTGATG AGCCCCCTAT GTTCTTGGCC CCAAGTTACA 
TCCAOGAACT CCAAGAAAAT GCAGCTGCTG GCACOGTGGT TGGGAGAGTG CATGCCAAAG 
ACCCTGATGC TGCCAACAGC CCGATAAGGT ATTCCATCGA TCGTCACACT GACCTCGACA 
GATTTTTCAC TATTAATCCA GAGGATGGTT TTATTAAAAC TACAAAACCT CTGGATAGAG 
AGGAAACAGC CTGGCTCAAC ATCACTGTCT TTGCAGCAGA AATCCACAAT CGGCATCAGG 
AAGCCCAAGT CCCAGTGGCC ATTAGGGTCC TTGATCTCAA CGATAATGCT CCCAAGTTTG 
CTGCCCCTTA TGAAGGTTTC ATCTCTGACA GTGATCAGAC CAAGCCACTT TCCAACCAGC 
CAATTGTTAC AATTAGTGCA GATGACAAGG ATGACACGGC CAATCGACCA AGATTTATCT 
TCAGCCTACC CCCTGAAATC ATTCACAATC CAAATTTCAC AGTCAGAGAC AACOGAGATA 
ACACAGCAGG CGTGTACGCC OGGOGTGGAG GGTTCAGTCG GCAGAAGCAC GACTTGTACC 
TTCTGCCCAT AGTGATCAGC CATCCCGGCA TCCOGCCCAT GAGTAGGACC AACACCCTCA 
CCATCAAAGT CTGOGGGTGC GAOGTGAACG GGGCACXCCT CTCCTGCAAC GCAGAGCCCT 
ACATTCTGAA OGCOGGCCTG AGCACAGGCG CCCTGATCCC CATCCTOCCC TGCATCGTCA 
TTCTOCTGGT CATTCTAGTA TTCTTTGTGA CCCTGAGAAG GCAAAAGAAA CAACCACTCA 
TTGTCTTTGA GGAAGAAGAT GTCOGTGAGA ACATCATTAC TTATGATGAT GAAGCGGGTC 
GGGAAGAAGA CACAGAAGCC TTTGATATTG CCACCCTCCA GAATCCTGAT CGTATCAATG 
GATTTATCCC CCGCAAAGAC ATCAAACCTC AGTATCAGTA CATGCCTAGA CCTGGCCTCC 
GGOCAGCGCC CAACAGOGTG GATGTCGATG ACTTCATCAA CACGAGAATA CAGGAGGCAG 
ACAATGACCC CACGGCTCCT CCTTATGACT CCATTCAAAT CTACCGTTAT GAAGGCAGGG 
CCTCAGTGGC CGGGTCOCTG AGCTCCCTAG AGTOGGOCAC CACAGATTCA GACTTGCACT 
ATGATTATCT ACACAACTCG CGACCTCGTT TTAAGAAACT AGCAGATTTG TATCGTTCCA 
AAGACACTTT TCATGAOGAT TCTTAACAAT AACCATACAA ATTTGCCCTT AAGAACTGTG 
TCTCGOGTTC TCAAGAATCT AGAAGATGTG TAACAGCTAT TTTTT 
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(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 amino acids 
(B j TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:S8: 

Met Lye Glu Asn Tyr Cys Leu Gin Ala Ala Leu Val Cys Leu Gly Met 
1 5 10 15 

Leu Cys Hie Ser His Ala Phe Ala Pro Glu Arg Arg Cly His Leu Arg 
20 25 30 

Pro Ser Phe His Gly His His Glu Lys Gly Lys Glu Gly Gin Val Leu 
35 40 45 

Gin Arg Ser Lys Arg Gly Trp Val Trp Asn Gin Phe Phe Val lie Glu 
50 55 60 

Glu Tyr Thr Gly Pro Asp Pro Val Leu Val Gly Arg Leu His Ser Asp 
65 70 75 60 

lie Asp Ser Gly Asp Gly Asn lie Lys Tyr lie Leu Ser Gly Glu Gly 
65 90 95 

Ala Gly Thr He Phe Val He Asp Asp Lys Ser Gly Asn He His Ala 
25 100 105 110 

Thr Lys Thr Leu Asp Arg Glu Glu Arg Ala Gin Tyr Thr Leu Met Ala 
115 120 125 



w 



15 



20 



30 



35 



Gin Ala Val Asp Arg Asp Thr Asn Arg Pro Leu Glu Pro Pro Ser Glu 
130 135 140 

Phe He Val Lye Val Gin Asp He Asn Asp Asn Pro Pro Glu Phe Leu 
145 150 1SS 160 

His Glu Thr Tyr His Ala Asn Val Pro Glu Arg Ser Asn Val Gly Thr 
165 170 175 

Ser Val He Gin Val Thr Ala Ser Asp Ala Asp Asp Pro Thr Tyr Gly 
180 185 190 

Asn Ser Ala Lys Leu Val Tyr Ser He Leu Glu Gly Gin Pro Tyr Phe 
195 200 205 

ao ser Val Glu Ala Gin Thr Gly He He Arg Thr Ala Leu Pro Asn Met 

210 215 220 

Asp Arg Glu Ala Lys Glu Glu Tyr His Val Val He Gin Ala Lys Asp 
225 230 235 240 



45 



Met Gly Gly His Met Gly Gly Leu Ser Gly Thr Thr Lys Val Thr He 
245 250 255 

Thr Leu Thr Asp Vai Asn Asp Asn Pro Pro Lys Phe Pro Gin Arg Leu 
260 : 265 270 

Tyr Gin Met Ser Val Ser Glu Ala Ala Val Pro Gly Glu Glu Val Gly 
50 275 280 285 

Arg Val Lys Ala Lys Asp Pro Asp He Gly Glu Asn Gly Leu Val Thr 
290 295 300 
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Tyr Asn He Val Asp Gly Asp Gly Ket Clu Ser Phe Glu He Thr Thr 
305 310 315 320 

Asp Tyr Glu Thr Gin Clu Gly val He Lye Leu Lye Lye Pro Val Asp 
325 330 335 

Phe Glu Thr Glu Arg Ala Tyr Ser Leu Lys Val Glu Ala Ala Asn Val 
340 345 350 

His He Aep Pro Lys Phe He Ser Asn Gly Pro Phe Lys Asp Thr Val 
355 360 365 

Thr Val Lys He Ser Val Glu Asp Ala Asp Glu Pro Pro Met Phe Leu 
370 375 380 

Ala Pro ser Tyr He His Glu Val Gin Glu Asn Ala Ala Ala Gly Thr 
385 390 395 400 

Val Val Gly Arg Val His Ala Lys Asp Pro Asp Ala Ala Asn Ser Pro 
405 410 415 

He Arg Tyr ser He Asp Arg His Thr Asp Leu Asp Arg Phe Phe Thr 
420 425 430 

He Asn Pro Glu Asp Gly Phe He Lys Thr Thr Lys Pro Leu Asp Arg 
435 440 445 

Glu Clu Thr Ala Trp Leu Asn He Thr Val Phe Ala Ala Glu He His 
450 455 460 

Asn Arg His Gin Glu Ala Gin Val Pro Val Ala He Arg Val Leu Asp 
465 470 475 480 

Val Asn Asp Asn Ala Pro Lys Phe Ala Ala Pro Tyr Glu Gly Phe He 
485 490 49S 

Cys Glu Ser Aep Cln Thr Lys Pro Leu Ser Asn Gin Pro He Val Thr 
1 500 505 510 

He Ser Ala Asp Asp Lys Asp Asp Thr Ala Asn Cly Pro Arg Phe He 
515 520 525 

35 phe Ser Leu Pro Pro Glu lie He His Asn Pro Asn Phe Thr Val Arg 

530 535 540 

Asp Asn Arg Asp Asn Thr Ala Cly Val Tyr Ala Arg Arg Gly Cly Phe 
545 550 555 560 

Ser Arg Cln Lys Cln Asp Leu Tyr Leu Leu Pro He Val He Ser Asp 

40 565 570 575 

Cly Cly He Pro Pro Met Ser Ser Thr Asn Thr Leu Thr He Lys Val 
580 585 590 

Cys Cly Cys Asp Val Asn Cly Ala Leu Leu ser Cys Asn Ala Clu Ala 

45 595 600 60S 

Tyr He Leu Asn Ala Gly Leu Ser Thr Gly Ala Leu He Ala He Leu 
610 615 620 
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Ala Cys He Val He Leu Leu Val lie Val Val Leu Phe Val Thr Leu 
625 63b 635 640 

Arg Arg Cln Lys Lys Clu Pro Leu He Val Phe Glu Glu Clu Asp Val 
645 650 655 
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Arg Glu Asn He He Thr Tyr Asp Asp Glu Gly Gly Gly Glu Glu Aap 
660 665 670 

Thr Glu Ala Phe Asp He Ala Thr Leu Gin Asn Pro Asp Gly He Asn 
5 675 680 685 

Gly Phe He Pro Arg Lys Asp He Lys Pro Glu Tyr Gin Tyr Met Pro 
690 695 700 

Arg Pro Gly Leu Arg Pro Ala Pro Asn Ser Val Asp Val Asp Asp Phe 

10 70S 710 715 720 

He Asn Thr Arg He Gin Glu Ala Asp Asn Asp Pro Thr Ala Pro Pro 

725 730 735 

Tyr Asp Ser He Gin He Tyr Gly Tyr Glu Cly Arg Gly Ser Val Ala 
J5 740 745 750 

Gly ser Leu Ser Ser Leu Glu Ser Ala Thr Thr Asp Ser Asp Leu Asp 
755 760 765 

Tyr Asp Tyr Leu Gin Asn Trp Gly Pro Arg Phe Lys Lys Leu Ala Asp 
770 775 780 

20 

Leu Tyr Gly Ser Lys Asp Thr Phe Aap Asp Asp Ser 
785 790 795 

(2) INFORMATION FOR SEQ ID NO:59i 

(1) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTHS 2521 base pairs 

(B) TYPEs nucleic acid 

(C) STRANDEDNESSx single 

(D) TOPOLOGY s linear 

(11) MOLECULE TYPE: cDNA 

30 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 59 : 





CGCTGGAGGC 


CACAGACACC TCAAACCTGG ATTCCACAAT TCTACGTTAA GTCTTGGAGT 


60 


35 


TTTTATTACT 


CTGCTCTAGG AAAGCCTTTG CCAATGCTTA CAAGGAACTG TTTATCCCTG 


120 




CTTCTCTGGC 


TTCTGTTTCA TCGAGCTCTC CTAACACCAC TACAACCACA GCCACAGCAG 


180 




ACTTTAGCCA 


CAGAGCCAAG AGAAAATCTT ATCCATCTGC CAGGACAACG GTCACATTTC 


240 


40 


CAACGTGTTA 




300 






AGTATCTCCC AAAGCTCCAT TCCGACTTAG ACAAGGGAGA GGGCACTCTC 


360 




AAATACACCC 


TCTCAGCAGA TGGOGCTGGC ACCGTTTTTA CCATTGATCA AACCACAGCG 


420 


45 


CACATTCATG 


CAATAAGGAG CCTAGATAGA GAAGAGAAAC CTTTCTACAC TCTTCCTCCT 


480 




CAGGCTGTGG 


ACATAGAAAC CACAAAGCCC CTGGAGCCTG AATCAGAATT CATCATCAAA 


540 




GTCCACCATA 


TTAATGATAA TGACCCAAAG TTTTTGGATC GACCTTATGT TGCTACTGTT 


600 


50 


CCAGAAATGT 


CTCCTCTCGG TCCATATCTA CTCCAGGTCA ACGCCACAGA TGCAGATGAC 


660 


CCGACCTATG 


GAAACAGTCC CAGAGTCGTT TACAGCATTC TTCAGGGACA ACCTTATTTC 


720 




TCTATTCATC 


CCAAGACAGG TCTTATTAGA ACAGCTTTGC CAAACATGGA CAGAGAAGTC 


780 
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AAAGAACAAT ATCAAGTACT CATCCAAGCC AAGGATATGG GAGGACAGCT TGCACCATTA 840 

GCCGGAACAA CAATAGTCAA CATCACTCTC ACCCATCTCA ATGACAATCC ACCTCGATTC 900 

CCCAAAAGCA TCTTCCACTT GAAAGTTCCT GAGTCTTCCC CTATTGGTTC AG CTATTGG A 960 

AGAATAAGAG CTCTGGATCC TGATTTTGGA CAAAATCCAG AAATTGAATA CAATATTGTT 1020 

CCAGCAGATG GGGGAAATTT GTTTGACATC GTCACAGATG AGGATACACA AGAGGGAGTC 1080 

ATCAAATTGA AAAACCCTTT AGATTTTCAA ACAAAGAAGG CATACACTTT CAAAGTTGAG 1140 

GCTTCCAACC TTCACCTTGA CCACCGGTTT CACTCGGCGG GCCCTTTCAA AGACACAGCT 1200 

ACGGTCAAGA TCAGCGTGCT GGACGTAGAT GAGCCACCGG TTTTCAGCAA GCCGCTCTAC 1260 

ACCATGGAGG TTTATGAAGA CACTCCGGTA CGCACCATCA TTGCCCCTGT CACTCCTCAA 1320 

GACCTGGATG TACGCAGCGG TGCTGTTAGG TACTTCATAG ATTGGAAGAC TGATGGGGAC 1380 

AGCTACTTTA CAATAGATGG AAATCAAGGA ACCATCGCCA CTAATGAATT ACTAGACAGA 1440 

GAAAGCACTG OGCACTATAA TTTCTCCATA ATTGOGAGTA AACTTAGTAA CCCTTTATTG 1500 

ACCAGCAAAG TCAATATACT CATTAATCTC TTAGATGTAA ATCAATTTCC TCCAGAAATA 1560 

TCTGTGCCAT ATGAGACAGC CCTCTGTGAA AATGCCAAGC CAGGACAGAT AATTCAGATA 1620 

GTCAGTGCTC CAGACCGAGA TCTTTCACCT CCTGGCCAAC AATTCTCCTT TAGATTATCA 1680 

CCTGAGGCTG CTATCAAACC AAATTTTACA GTTOGTGACT TCAGAAACAA CACAGCGGGG 1740 

ATTGAAACCC GAAGAAATGG ATACAGCCGC AGGCAGCAAG AGTTCTATTT CCTCCCTGTT 1800 

GTAATAGAAG ACAGCAGCTA CCCTCTOCAG AGCAGCACAA ACACAATCAC TATTCGAGTC 1860 

TGTAGATGTG ACTCTGATCG CACCATCCTG TCTTCTAATC TGGAAGCAAT TTTTCTACCT 1920 

GTAGGACTTA GCACTGGGGC GTTGATTGCA ATTCTACTAX GCATTGTTAT ACTCTTAGCC 1980 

ATAGTTGTAC TGTATGTAGC ACTGCGAAGG CAGAAGAAAA AGCACACCCT GATGACCTCT 2040 

AAAGAAGACA TCAGAGACAA CGTCATCCAT TAOGATGATG AAGCAGGTGC GGAGGAAGAT 2100 

ACCCAGGCTT TCGACATOGG GGCTCTGACA AACCCAAAAG TGATTGAGGA CAACAAAATT 2160 

CGCAGGCATA TAAAACCAGA CTCTCTCTGT TTACCTCCTC AGACACCACC CATGGAAGAT 2220 

AACACAGACA TAACGGATTT CATTCATCAA AGCCTACAGC AAAATGATGT AGATCCAACT 2280 

GCCCCACCAA TCGATTCACT GGCCACATAT GCCTAOGAAG GGAGTGGCTC CGTGGCAGAG 2340 

TCCCTCAGCT CTATACACTC TCTCACCACA GAAGCCGACC AGGACTATGA CTATCTGACA 2400 

GACTCGGGAC CCCGCTTTAA AGTCTTGGCA GACATGTTTG GOGAAGAAGA GAGTTATAAC 2460 
CCTGATAAAG TCACTTAAGC GAGTCGTCGA GGCTAAAATA CAACCGAGAG GGGAGATTTT 2520 
T 2521 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID K0:60: 

Met Leu Thr Arg Asn Cys Leu Ser Leu Leu Leu Trp Val Leu Phe Asp 
1 5 10 15 

Gly Gly Leu Leu Thr Pro Leu Gin Pro Gin Pro Gin Gin Thr Leu Ala 
10 20 25 30 

Thr Glu Pro Arg Glu Asn Val He Hie Leu Pro Gly Gin Arg Ser His 
35 40 45 



Phe Gin Arg Val Lye Arg Gly Trp Val Trp Asn Gin Phe Phe Val Leu 
50 55 60 

Glu Glu Tyr Val Gly Ser Glu Pro Gin Tyr Val Gly Lye Leu Hie Ser 
65 70 75 80 

Asp Leu Asp Lys Gly Glu Gly Thr Val Lys Tyr Thr Leu Ser Gly Asp 
85 90 95 

Gly Ala Gly Thr Val Phe Thr He Asp Glu Thr Thr Gly Asp lie His 
100 105 HO 

Ala He Arg Ser Leu Asp Arg Glu Glu Lys Pro Phe Tyr Thr Leu Arg 
115 120 125 

Ala Gin Ala Val Asp He Glu Thr Arg Lys Pro Leu Glu Pro Glu Ser 
130 135 140 

Glu Phe He He Lys Val Gin Asp lie Asn Asp Asn Glu Pro Lys Phe 
145 150 155 160 

Leu Asp Gly Pro Tyr Val Ala Thr Val Pro Glu Met Ser Pro Val Gly 
165 170 175 

Ala Tyr Val Leu Gin Val Lys Ala Thr Asp Ala Asp Asp Pro Thr Tyr 
180 185 190 

35 Gly Asn Ser Ala Arg Val Val Tyr Ser He Leu Gin Gly Gin Pro Tyr 

195 200 205 

Phe Ser He Asp Pro Lye Thr Gly Val He Arg Thr Ala Leu Pro Asn 
210 215 220 

40 Met Asp Arg Glu Val Lye Glu Gin Tyr Gin Val Leu He Gin Ala Lys 

225 230 235 240 

Asp Met Gly Gly Gin Leu Gly Gly Leu Ala Gly Thr Thr He Val Asn 
245 250 255 
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He Thr Leu Thr Asp Val Asn Asp Asn Pro Pro Arg Phe Pro Lys Ser 
260 265 270 

He Phe His Leu Lys Val Pro Glu Ser Ser Pro He Gly Ser Gly He 
275 2B0 285 

Gly Arg He Arg Ala Val Asp Pro Asp Phe Gly Gin Asn Ala Glu He 

50 290 295 300 

Glu Tyr Asn He Val Pro Gly Asp Gly Gly Asn Leu Phe Asp He Val 
305 310 315 320 
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Thr Asp Clu Asp Thr Cln Glu Gly Val He Lye Leu Lys Lye Pro Leu 
325 330 335 

Asp Phe Glu Thr Lys Lys Ala Tyr Thr Phe Lys Val Glu Ala Ser Ann 
340 345 350 

Leu Hie Leu Aop Hia Arg Phe Hia Ser Ala Gly Pro Phe Lye Asp Thr 
355 360 365 

Ala Thr Val Lys He Ser Val Leu Asp Val Asp Glu Pro Pro Val Phe 

370 375 380 

Ser Lys Pro Leu Tyr Thr Met Glu Val Tyr Glu Asp Thr Pro Val Gly 
385 390 395 400 

Thr lie He Gly Ala Val Thr Ala Gin Asp Leu Asp Val Gly Ser Gly 
405 410 415 

Ala Val Arg Tyr Phe He Asp Trp Lys Ser Asp Gly Asp Ser Tyr Phe 
420 425 430 

Thr He Asp Gly Asn Glu Cly Thr lie Ala Thr Asn Glu Leu Leu Asp 
435 440 445 

Arg Glu Ser Thr Ala Gin Tyr Asn Phe Ser He He Ala Ser LyB Val 
450 455 460 

Ser Asn Pro Leu Leu Thr Ser Lys Val Asn He Leu He Asn Val Leu 
465 470 475 480 

Asp Val Asn Glu Phe Pro Pro Clu He Ser Val Pro Tyr Glu Thr Ala 
485 490 495 

Val Cyo Glu Asn Ala Lys Pro Gly Gin He He Gin He Val Ser Ala 
500 505 510 

Ala Asp Arg Asp Leu Ser Pro Ala Gly Gin Gin Phe Ser Phe Arg Leu 
515 520 525 

Ser Pro Glu Ala Ala He Lys Pro Asn Phe Thr Val Arg Asp Phe Arg 
530 535 540 

Asn Asn Thr Ala Gly He Glu Thr Arg Arg Asn Gly Tyr Ser Arg Arg 
545 550 555 560 

Gin Gin Glu Leu Tyr Phe Leu Pro Val Val He Glu Asp Ser Ser Tyr 
565 570 575 

Pro Val Cln Ser Ser Thr Asn Thr Ket Thr He Arg Val Cys Arg Cys 
580 585 590 

Asp Ser Asp Gly Thr He Leu Ser Cys Asn Val Glu Ala He Phe Leu 
595 600 605 

Pro Val Gly Leu Ser Thr Cly Ala Leu He Ala He Leu Leu Cys He 
610 615 620 

Val He Leu Leu Ala He Val Val Leu Tyr Val Ala Leu Arg Arg Gin 
625 630 635 640 

Lys Lys Lys His Thr Leu Met Thr Ser Lys Glu Asp He Arg Asp Asn 
* 645 650 655 

Val He His Tyr Asp Asp Glu Gly Gly Gly Glu Glu Asp Thr Gin Ala 
660 665 670 
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Phe Asp lie Gly Ala Leu Arg Asn Pro Lys Val lie Clu Clu Asn Lye 
675 680 665 

lie Arg Arg Asp lie Lys Pro Asp Ser Leu Cye Leu Pro Arg Cln Arg 
690 695 700 

Pro Pro Met Clu Asp Asn Thr Asp He Arg Asp Phe He His Gin Arg 
705 710 715 720 

Leu Gin Glu Asn Asp Val Asp Pro Thr Ala Pro Pro He Asp Ser Leu 
725 730 735 

Ala Thr Tyr Ala Tyr Glu Gly Ser Gly Ser Val Ala Glu Ser Leu Ser 
740 7<5 750 

Ser lie Asp Ser Leu Thr Thr Glu Ala Asp Gin Asp Tyr Asp Tyr Leu 
755 760 765 

Thr Asp Trp Gly Pro Arg Phe Lys Val Val Ala Asp Met Phe Gly Clu 
770 775 780 

Clu Glu Ser Tyr Asn Pro Asp Lys Val Thr 
785 790 

(2) INFORMATION FOR SEQ ID NO: 61: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION : SEQ ID KO:61: 

CTTCAAGCTT TTGCTGACTC AGTCTGCTAG TCAGAGTCTC CAGGAGAAGA CAGTTCAAGG 60 

CAGGGCCTGG AGCATTGCAT CAGTTTAGGG ACAGGTCAAA GGCTCGCTTA GAGACCTTAG 120 

AGGCAGGTTG CTTCGGTCGT TCAATGCTAG TCTGCTCCTG AGAGCCCTTT TCTCTGGCAA 180 

35 CTGTGGACTC AGAGCTAACC AATTCTACTT GGCAGTGGGG GTGAAGGGTG ATCCAGAGGC 240 

CTGAGCTGCA GACGGCACAA GAGAGAAAAG ATGTCTTAGA AACAGCTTTG ACAACATGCC 300 

TTGGCT C CTG GCAGGGACCT TGGATGGCGT AGTCTACACC CGGAAGTGCC TGCCTGCCAT 360 

40 CCTCTACTGG CTGCCTTGCA AAATATGCTC ACTCGAGCCC CGTGCATGAA TGAAAACCCC 420 

GCOGGGCGCT TCTAGTCGGA CAAAATGCAG COGAGAACTC CGCTCGTTCT GTGCGTTCTC 480 

CTGTCCCAGC TGCTGCTGCT AACATCTCCA GAAGAXTTGG ACTGCACTCC TGGATTTCAG 540 

CAGAAAGTCT TCCATATCAA TCAGCCAGCT GAATTCATTG AGGACCAGTC AATTCTAAAC 600 

TTGACCTTCA GTGACTGTAA GGCAAACGAC AAGCTAOGCT ATGAGGTCTC GAGCCCATAC 660 

TTCAAGGTGA ACAGCGATGG CGGCTTAGTT CCTCIGAGAA ACATAACTCC AGTGGGCAAA 720 

ACTCTGTTCG TCCATGCAOG GACCCCCCAT GCGGAACATA TGGCAGAACT CGTGATTGTC 780 

GGGGGGAAAC ACATCCAGGG CTCCTTGCAG GAXAtATTTA AATTTGCAAG AACTTCTCCT 840 

GTCCCAAGAC AAAACAGGTC CATTGTGCTA TCTOC CA TTT TAATTCCAGA GAATCAGACA 900 
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CAGCCTTTCC CAACAGATGT TCCCAACCTA CTCGATAGTG ACAGGCCACA AACCTCCAAG 
TTCCGGCTCA CTGGAAAGGG AGTGGATCAA CACCCTAAAC CAATTTTCAC AATCAATCAG 
AACACAGCGA GOCTCTCOCT CACACGGACC TTGGACAGAG AAGTAATCGC TGTTTATCAA 
CTATTTGTGG AGACCACTGA TGTCAATGGC AAAACTCTCG AGGCGCCGGT GCCTCTGGAA 
GTCATTGTGA TTCATCAGAA TCACAACCGA CCGATCTTTC CCGAAGGCCC CTACATCGGC 
CACGTCATGG AAGGGTCACC CACAGCCACC ACAGTGATGC CCATCACAGC CTTTGATGCA 
GATGACCCAG CCACCGATAA TGCCCTCCTC CGCTATAATA TCCGTCAACA GACGCCTCAC 
AAGCCATCTC CCAACATGTT CTACATCGAT CCTCAGAAAG GAGACATTGT CACTGTTGTG 
TCACCTGCCC TCCTCGACCG AGAGACTCTG GAAAATCCCA AGTATGAACT GATCATCGAC 
GCTCAAGATA TGGCTGGACT CGATGTTGGA TTAACACCCA CGGCCACACC CACGATCATG 
ATCGATGACA AAAATGATCA CTCACCAAAA TTCACCAACA AAGACTTTCA ACCCACAGTC 
CAGGAAGGAG CTGTGGGAGT TATTGTCAAT TTGACAGTTG AAGATAAGCA TCACCCCACC 
ACAGCTGCAT GGAGCCCTGC CTACACCATC ATCAACCGAA ACCCCGGCCA GACCTTTGAA 
ATCCACACCA ACCCTCAAAC CAACCAACGG ATCCTTTCTG TTGTCAAACC ATTCGACTAT 
GAAATTTCTG CCTTCCACAC CCTGCTGATC AAAGTGGAAA ATGAAGACCC ACTCGTACCC 
GACCTCTCCT ACGGCCCCAG CTCCACAGCC ACCGTCCACA TCACTGTCCT CCATCTCAAC 
GAGCQCCCAG TCTTCTACCC AGACCCCATG ATCGTGACCA CCCAGGACGA CCTCTCTCTG 
GGCAGCCTGC TGCTGACAGT GAATCCCACG GACCCCGACT CXXTGCAGCA TCAAACCATC 
AGGTATTCTG TTTACAAGGA CCCACCAGGT TGGCTGAATA TTAACCCCAT CAATGGGACT 
GTTGACACCA CAGCTGTCCT GGACCCTGAG TCCCCATTTG TCGACAACAG OGTGTACACT 
GCTCTCTTCC TGGCAATTGA CAGTGGCAAC CCTCCOCCTA OGGGCACTGG GACTTTCCTG 
ATAACCCTCG AGCACGTGAA TGACAATCCC CCGTTCATTT ACCCCACAGT AGCTGAAGTC 
TGTGATGATG CCAAAAACCT CAGTGTAGTC ATTTTGGGAG CATCAGATAA CCATCTTCAC 
CCGAATACAG ATCCTTTCAA ATTTGAAATC CACAAACAAG CTGTTCCTGA TAAAGTCTCG 
AAGATCTCCA AGATCAACAA TACACACCCC CTGGTAAGCC TTCTTCAAAA TCTGAACAAA 
GCAAACTACA AOCTCCCCAT CATGGTGACA GATTCAGGGA AACCACCCAT GACGAATATC 
ACAGATCTCA GGGTACAAGT GTCCTCCTGC AGGAATTCCA AAGTGGACTG CAACCCGGCC 
GGGGCCCTGC GCTTCAGCCT GCCCTCAGTC CTCCTCCTCA GCCTCTTCAG CTTAGCTTGT 
CTCTGAGAAC TCCTGAOGTC TGAAGCTTGA CTCCCAAGTT TCCATAGCAA CACGAAAAAA 
AAAAAATCTA TCCAAATCTG AAGATTGCGG TTTACAGCTA TCGAACTTCG 
(2) INFORMATION FOR SEQ ID HQs 62: 

(i) SEQUENCE CHARACTERISTICS x 

(A) LENGTHS 713 amino acids 

(B) TTPEt amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Gin Pro Arg Thr Pro Leu Val Leu eye Val Leu Leu ser Gin Val 
15 10 16 

Leu Leu Leu Thr Ser Ala Glu Asp Leu Asp Cys Thr Pro Gly Phe Gin 

10 20 25 30 

Gin Lyo Val Phe His lie Aon Gin Pro Ala Glu Phe He Glu Asp Gin 
35 40 45 



Ser He Leu Asn Leu Thr Phe Ser Asp Cys Lys Gly Asn Asp Lys Leu 
50 55 60 

Arg Tyr Glu Val Ser Ser Pro Tyr Phe Lys Val Asn Ser Asp Gly Cly 
65 70 75 80 

Leu Val Ala Leu Arg Asn He Thr Ala Val Gly Lys Thr Leu Phe Val 
85 90 95 

His Ala Arg Thr Pro His Ala Glu Asp Met Ala Glu Leu Val He Val 
100 105 110 

Gly Gly Lys Asp He Gin Gly Ser Leu Gin Asp He Phe Lys Phe Ala 
115 120 125 

Arg Thr Ser Pro Val Pro Arg Gin Lys Arg Ser He Val Val Ser Pro 
130 135 140 

He Leu He Pro Glu Asn Gin Arg Gin Pro Phe Pro Arg Asp Val Gly 
145 150 155 160 

Lys Val Val Asp Ser Asp Arg Pro Glu Arg Ser Lys Phe Arg Leu Thr 
165 170 175 

Gly Lys Gly Val Asp Gin Glu Pro Lys Gly He Phe Arg He Asn Glu 
180 165 190 

35 Asn Tbr Gly Ser Val Ser Val Thr Arg Thr Leu Asp Arg Glu Val He 

195 200 205 

Ala Val Tyr Gin Leu Phe Val Glu Thr Thr Asp Val Asn Gly Lyo Thr 
210 215 220 

4Q Leu Glu Gly Pro Val Pro Leu Glu Val He Val He Asp Gin Asn Aop 

225 230 235 240 

Aon Arg Pro He Phe Arg Glu Gly Pro Tyr He Gly His Val Met Glu 
245 250 255 
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Cly Ser Pro Thr Gly Thr Thr Val Met Arg Met Thr Ala Phe Aop Ala 
260 265 270 

Asp Asp Pro Ala Thr Asp Asn Ala Leu Leu Arg Tyr Asn He Arg Gin 
275 280 285 

Gin Thr Pro Asp Lys Pro Ser Pro Asn Met Phe Tyr He Asp Pro Glu 
290 ' 295 300 

Lys Gly Asp He Val Thr Val Val Ser Pro Ala Leu Leu Asp Arg Glu 
305 310 315 320 
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Thr Leu Clu Asn Pro Lys Tyr Clu Leu He lie Clu Ala Gin Asp Met 
325 330 335 

Ala Cly Leu Asp Val Cly Leu Thr Gly Thr Ala Thr Ala Thr He Met 
340 * 345 350 

He Asp Asp Lys Asn Asp His Ser Pro Lys Phe Thr Lys Lys Glu Phe 
355 360 365 

Gin Ala Thr Val Glu Glu Gly Ala Val Gly Val He Val Asn Leu Thr 
370 375 380 

Val Glu Asp Lys Asp Asp Pro Thr Thr Gly Ala Trp Arg Ala Ala Tyr 
385 390 395 400 

Thr He He Asn Gly Asn Pro Gly Gin Ser Phe Glu He His Thr Asn 
405 410 415 

Pro Gin Thr Asn Glu Gly Met Leu Ser Val Val Lys Pro Leu Asp Tyr 
420 42S 430 

Glu He Ser Ala Phe His Thr Leu Leu He Lys Val Glu Asn Glu Asp 
435 440 445 

Pro Leu Val Pro Asp Val Ser Tyr Cly Pro Ser Ser Thr Ala Thr Val 
450 455 460 

His He Thr Val Leu Asp Val Asn Clu Gly Pro Val Phe Tyr Pro Asp 
465 470 475 480 

Pro Hot Met Val Thr Arg Gin Glu Asp Leu Ser Val Gly Ser Val Leu 
465 490 495 

L#u Thr Val Asn Ala Thr Asp Pro Asp Ser Leu Gin His Gin Thr He 
S00 505 510 

30 Arg Tyr Ser Val Tyr Lys Asp Pro Ala Gly Trp Leu Asn He Asn Pro 

515 520 525 

He Asn Cly Thr Val Asp Thr Thr Ala Val Leu Asp Arg Glu Ser Pro 
530 535 540 

35 Phe Val Asp Asn Ser Val Tyr Thr Ala Leu Phe Leu Ala He Asp Ser 

545 ^ 550 555 560 

Cly Asn Pro Pro Ala Thr Gly Thr Gly Thr Leu Leu He Thr Leu Glu 
565 570 575 

Asp Val Asn Asp Asn Ala Pro Phe He Tyr Pro Thr Val Ala Glu Val 

40 580 585 590 

Cys Asp Asp Ala Lys Asn Leu Ser Val Val He Leu Gly Ala Ser Asp 

595 600 605 

Lys Asp Leu His Pro Asn Thr Asp Pro Phe Lys Phe Glu He His Lys 

45 610 615 620 

Gin Ala Val Pro Asp Lys Val Trp Lys He Ser Lys He Asn Asn Thr 
625 630 635 640 
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His Ala Leu Val Ser Leu Leu Gin Asn Leu Asn Lys Ala Asn Tyr Asn 
645 650 655 

Leu Pro He Met Val Thr Asp Ser Cly Lys Pro Pro Met Thr Asn He 
660 665 670 
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Thr Asp Leu Arg VaX Gin Val Cye Ser Cys Arg Ann Ser Lye Val Asp 
675 680 685 

5 Cys Aen Ala Ala Gly Ala Leu Arg Phe Ser Leu Pro Ser Val lie Leu 

690 695 700 

Leu Ser Leu Phe Ser Leu Ala Cys Leu 

705 710 



Claims 

75 

1. A purified and isolated polynucleotide encoding a cadherin selected from the group consisting of cadherin-6, cad- 
herin-7, cadherin-9 and cadherin-10. 

2. The polynucleotide of claim 1 which is a DNA sequence. 

20 

3. The polynucleotide of claim 2 which is a cDNA sequence or biological replica thereof. 

4. The polynucleotide of claim 3 which is SEQ ID NO: 51 . 
25 5. The polynucleotide of claim 3 which is SEQ ID NO: 15. 

6. The polynucleotide of claim 3 which is SEQ ID NO: 19 or SEQ ID NO: 33. 

7. The polynucleotide of claim 3 which is SEQ ID NO: 55. 

30 

8. The polynucleotide of claim 2 which is a genomic DNA or a biological replica thereof. 

9. The DNA of claim 2 which is a wholly or partially chemically synthesized DNA or a biological replica thereof. 
35 10. A biologically functional DNA vector comprising a DNA according to claim 2. 

11. The vector of claim 10 wherein said DNA is operatively linked to an expression control DNA sequence. 

12. A host cell stably transformed or transfected with a DNA according to claim 2 in a manner allowing the expression 
40 in said host cell of the cadherin polypeptide encoded thereby. 

13. A method for producing a cadherin polypeptide comprising the steps of growing a host cell according to claim 12 
in a suitable nutrient medium and isolating the cadherin from said cell or from the medium of its growth. 

45 14. A purified and isolated full length cadherin polypeptide selected from the group consisting of cadherin-6 polypep- 
tide (SEQ ID NO: 52), cadherin-7 polypeptide (SEQ ID NO: 16), cadherin-9 polypeptide (SEQ ID NO: 20 or 34) and 
cadherin-10 polypeptide (SEQ ID NO: 56). 

15. A hybridoma cell line producing a monoclonal antibody specific for a cadherin selected from the group consisting 
so of cadherin-6, cadherin-7, cadherin-9 and cadherin-10. 

16. A hybridoma cell line producing a monoclonal antibody specific for cadherin-5 selected from the group consisting 
of 30Q8A(ATCC HB1 1316), 30Q4H (ATCC HB1 1317), 45A5G (ATCC HB1 1318), 30S2F (ATCC HB1 1319), 45C6A 
(ATCC HB1 1320) and 30T1 1 G (ATCC 1 1 324). 

55 

17. A monoclonal antibody produced by the hybridoma cell line of claim 16. 

18. An antibody substance specif ic for a cadherin selected from the group consisting of cadherin-6, cadherin-7, cad- 
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herin-9 and cadherin-10. 



1 9 A method for modulating the binding capability of a cadherin selected from the group consisting of cadherin-6 cad- 
herin-7, cadherin-9 and cadherin-10 comprising contacting the cadherin with an antibody substance spec.fic for 
said cadherin according to claim 18. 

20 A method for modulating the binding capability of a cadherin selected from the group consisting of cadherin-6, cad- 
* herin-7, cadherin-9 and cadherin-1 0 comprising contacting the cadherin with a polypeptide or pept.de ligand of the 



cadherin 



21 A method for modulating the binding capability of a cadherin selected from the group consisting of cadherin-6, cad- 
herin-7, cadherin-9 and cadherin-10 comprising contacting the cadherin with a peptide of said cadherin. 
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